Pesquisa | BVS - MINISTÉRIO DA SAÚDE

1.

AllergenAI: a deep learning model predicting allergenicity based on protein sequence.

Yang, Chengyuan; Negi, Surendra S; Schein, Catherine H; Braun, Werner; Kim, Pora.

bioRxiv ; 2024 Jun 27.

Artigo em Inglês | MEDLINE | ID: mdl-38979176

RESUMO

Innovations in protein engineering can help redesign allergenic proteins to reduce adverse reactions in sensitive individuals. To accomplish this aim, a better knowledge of the molecular properties of allergenic proteins and the molecular features that make a protein allergenic is needed. We present a novel AI-based tool, AllergenAI, to quantify the allergenic potential of a given protein. Our approach is solely based on protein sequences, differentiating it from previous tools that use some knowledge of the allergens' physicochemical and other properties in addition to sequence homology. We used the collected data on protein sequences of allergenic proteins as archived in the three well-established databases, SDAP 2.0, COMPARE, and AlgPred 2, to train a convolutional neural network and assessed its prediction performance by cross-validation. We then used Allergen AI to find novel potential proteins of the cupin family in date palm, spinach, maize, and red clover plants with a high allergenicity score that might have an adverse allergenic effect on sensitive individuals. By analyzing the feature importance scores (FIS) of vicilins, we identified a proline-alanine-rich (P-A) motif in the top 50% of FIS regions that overlapped with known IgE epitope regions of vicilin allergens. Furthermore, usingâ¼ 1600 allergen structures in our SDAP database, we showed the potential to incorporate 3D information in a CNN model. Future, incorporating 3D information in training data should enhance the accuracy. AllergenAI is a novel foundation for identifying the critical features that distinguish allergenic proteins.

2.

Radiogenomics-Based Risk Prediction of Glioblastoma Multiforme with Clinical Relevance.

Qian, Xiaohua; Tan, Hua; Liu, Xiaona; Zhao, Weiling; Chan, Michael D; Kim, Pora; Zhou, Xiaobo.

Genes (Basel) ; 15(6)2024 Jun 01.

Artigo em Inglês | MEDLINE | ID: mdl-38927654

RESUMO

Glioblastoma multiforme (GBM)is the most common and aggressive primary brain tumor. Although temozolomide (TMZ)-based radiochemotherapy improves overall GBM patients' survival, it also increases the frequency of false positive post-treatment magnetic resonance imaging (MRI) assessments for tumor progression. Pseudo-progression (PsP) is a treatment-related reaction with an increased contrast-enhancing lesion size at the tumor site or resection margins miming tumor recurrence on MRI. The accurate and reliable prognostication of GBM progression is urgently needed in the clinical management of GBM patients. Clinical data analysis indicates that the patients with PsP had superior overall and progression-free survival rates. In this study, we aimed to develop a prognostic model to evaluate the tumor progression potential of GBM patients following standard therapies. We applied a dictionary learning scheme to obtain imaging features of GBM patients with PsP or true tumor progression (TTP) from the Wake dataset. Based on these radiographic features, we conducted a radiogenomics analysis to identify the significantly associated genes. These significantly associated genes were used as features to construct a 2YS (2-year survival rate) logistic regression model. GBM patients were classified into low- and high-survival risk groups based on the individual 2YS scores derived from this model. We tested our model using an independent The Cancer Genome Atlas Program (TCGA) dataset and found that 2YS scores were significantly associated with the patient's overall survival. We used two cohorts of the TCGA data to train and test our model. Our results show that the 2YS scores-based classification results from the training and testing TCGA datasets were significantly associated with the overall survival of patients. We also analyzed the survival prediction ability of other clinical factors (gender, age, KPS (Karnofsky performance status), normal cell ratio) and found that these factors were unrelated or weakly correlated with patients' survival. Overall, our studies have demonstrated the effectiveness and robustness of the 2YS model in predicting the clinical outcomes of GBM patients after standard therapies.

Assuntos

Neoplasias Encefálicas , Glioblastoma , Imageamento por Ressonância Magnética , Humanos , Glioblastoma/genética , Glioblastoma/diagnóstico por imagem , Glioblastoma/patologia , Glioblastoma/mortalidade , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/diagnóstico por imagem , Neoplasias Encefálicas/patologia , Neoplasias Encefálicas/mortalidade , Masculino , Feminino , Imageamento por Ressonância Magnética/métodos , Pessoa de Meia-Idade , Prognóstico , Adulto , Idoso , Progressão da Doença , Temozolomida/uso terapêutico , Genômica/métodos , Taxa de Sobrevida , Relevância Clínica

3.

FusionNW, a potential clinical impact assessment of kinases in pan-cancer fusion gene network.

Yang, Chengyuan; Kumar, Himansu; Kim, Pora.

Brief Bioinform ; 25(2)2024 Jan 22.

Artigo em Inglês | MEDLINE | ID: mdl-38493341

RESUMO

Kinase fusion genes are the most active fusion gene group in human cancer fusion genes. To help choose the clinically significant kinase so that the cancer patients that have fusion genes can be better diagnosed, we need a metric to infer the assessment of kinases in pan-cancer fusion genes rather than relying on the sample frequency expressed fusion genes. Most of all, multiple studies assessed human kinases as the drug targets using multiple types of genomic and clinical information, but none used the kinase fusion genes in their study. The assessment studies of kinase without kinase fusion gene events can miss the effect of one of the mechanisms that enhance the kinase function in cancer. To fill this gap, in this study, we suggest a novel way of assessing genes using a network propagation approach to infer how likely individual kinases influence the kinase fusion gene network composed of ~5K kinase fusion gene pairs. To select a better seed of propagation, we chose the top genes via dimensionality reduction like a principal component or latent layer information of six features of individual genes in pan-cancer fusion genes. Our approach may provide a novel way to assess of human kinases in cancer.

Assuntos

Redes Reguladoras de Genes , Neoplasias , Humanos , Neoplasias/genética , Fusão Gênica

4.

Systematic characterization of protein structural features of alternative splicing isoforms using AlphaFold 2.

Yang, Yuntao; Xie, Yuhan; Li, Zhao; Diala, Chiamaka; Ali, Meer; Li, Rongbin; Xu, Yi; Wu, Albon; Kim, Pora; Hosseini, Sayed-Rzgar; Bi, Erfei; Zhao, Hongyu; Zheng, W Jim.

bioRxiv ; 2024 Feb 19.

Artigo em Inglês | MEDLINE | ID: mdl-38464054

RESUMO

Alternative splicing is an important cellular process in eukaryotes, altering pre-mRNA to yield multiple protein isoforms from a single gene. However, our understanding of the impact of alternative splicing events on protein structures is currently constrained by a lack of sufficient protein structural data. To address this limitation, we employed AlphaFold 2, a cutting-edge protein structure prediction tool, to conduct a comprehensive analysis of alternative splicing for approximately 3,000 human genes, providing valuable insights into its impact on the protein structural. Our investigation employed state of the art high-performance computing infrastructure to systematically characterize structural features in alternatively spliced regions and identified changes in protein structure following alternative splicing events. Notably, we found that alternative splicing tends to alter the structure of residues primarily located in coils and beta-sheets. Our research highlighted a significant enrichment of loops and highly exposed residues within human alternatively spliced regions. Specifically, our examination of the Septin-9 protein revealed potential associations between loops and alternative splicing, providing insights into its evolutionary role. Furthermore, our analysis uncovered two missense mutations in the Tau protein that could influence alternative splicing, potentially contributing to the pathogenesis of Alzheimer's disease. In summary, our work, through a thorough statistical analysis of extensive protein structural data, sheds new light on the intricate relationship between alternative splicing, evolution, and human disease.

5.

Study of prognostic splicing factors in cancer using machine learning approaches.

Yang, Mengyuan; Liu, Jiajia; Kim, Pora; Zhou, Xiaobo.

Hum Mol Genet ; 33(13): 1131-1141, 2024 Jun 21.

Artigo em Inglês | MEDLINE | ID: mdl-38538560

RESUMO

Splicing factors (SFs) are the major RNA-binding proteins (RBPs) and key molecules that regulate the splicing of mRNA molecules through binding to mRNAs. The expression of splicing factors is frequently deregulated in different cancer types, causing the generation of oncogenic proteins involved in cancer hallmarks. In this study, we investigated the genes that encode RNA-binding proteins and identified potential splicing factors that contribute to the aberrant splicing applying a random forest classification model. The result suggested 56 splicing factors were related to the prognosis of 13 cancers, two SF complexes in liver hepatocellular carcinoma, and one SF complex in esophageal carcinoma. Further systematic bioinformatics studies on these cancer prognostic splicing factors and their related alternative splicing events revealed the potential regulations in a cancer-specific manner. Our analysis found high ILF2-ILF3 expression correlates with poor prognosis in LIHC through alternative splicing. These findings emphasize the importance of SFs as potential indicators for prognosis or targets for therapeutic interventions. Their roles in cancer exhibit complexity and are contingent upon the specific context in which they operate. This recognition further underscores the need for a comprehensive understanding and exploration of the role of SFs in different types of cancer, paving the way for their potential utilization in prognostic assessments and the development of targeted therapies.

Assuntos

Processamento Alternativo , Biologia Computacional , Regulação Neoplásica da Expressão Gênica , Aprendizado de Máquina , Neoplasias , Fatores de Processamento de RNA , Humanos , Fatores de Processamento de RNA/genética , Fatores de Processamento de RNA/metabolismo , Prognóstico , Processamento Alternativo/genética , Neoplasias/genética , Biologia Computacional/métodos , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Splicing de RNA/genética , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Neoplasias Hepáticas/genética , Carcinoma Hepatocelular/genética

6.

StemDriver: a knowledgebase of gene functions for hematopoietic stem cell fate determination.

Luo, Yangyang; Guo, Jingjing; Wen, Jianguo; Zhao, Weiling; Huang, Kexin; Liu, Yang; Wang, Grant; Luo, Ruihan; Niu, Ting; Feng, Yuzhou; Xu, Haixia; Kim, Pora; Zhou, Xiaobo.

Nucleic Acids Res ; 52(D1): D1042-D1052, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37953308

RESUMO

StemDriver is a comprehensive knowledgebase dedicated to the functional annotation of genes participating in the determination of hematopoietic stem cell fate, available at http://biomedbdc.wchscu.cn/StemDriver/. By utilizing single-cell RNA sequencing data, StemDriver has successfully assembled a comprehensive lineage map of hematopoiesis, capturing the entire continuum from the initial formation of hematopoietic stem cells to the fully developed mature cells. Extensive exploration and characterization were conducted on gene expression features corresponding to each lineage commitment. At the current version, StemDriver integrates data from 42 studies, encompassing a diverse range of 14 tissue types spanning from the embryonic phase to adulthood. In order to ensure uniformity and reliability, all data undergo a standardized pipeline, which includes quality data pre-processing, cell type annotation, differential gene expression analysis, identification of gene categories correlated with differentiation, analysis of highly variable genes along pseudo-time, and exploration of gene expression regulatory networks. In total, StemDriver assessed the function of 23 839 genes for human samples and 29 533 genes for mouse samples. Simultaneously, StemDriver also provided users with reference datasets and models for cell annotation. We believe that StemDriver will offer valuable assistance to research focused on cellular development and hematopoiesis.

Assuntos

Hematopoese , Células-Tronco Hematopoéticas , Animais , Humanos , Camundongos , Redes Reguladoras de Genes , Hematopoese/genética , Células-Tronco Hematopoéticas/metabolismo , Reprodutibilidade dos Testes , Bases de Conhecimento , Linhagem da Célula

7.

COV2Var, a function annotation database of SARS-CoV-2 genetic variation.

Feng, Yuzhou; Yi, Jiahao; Yang, Lin; Wang, Yanfei; Wen, Jianguo; Zhao, Weiling; Kim, Pora; Zhou, Xiaobo.

Nucleic Acids Res ; 52(D1): D701-D713, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37897356

RESUMO

The COVID-19 pandemic, caused by the coronavirus SARS-CoV-2, has resulted in the loss of millions of lives and severe global economic consequences. Every time SARS-CoV-2 replicates, the viruses acquire new mutations in their genomes. Mutations in SARS-CoV-2 genomes led to increased transmissibility, severe disease outcomes, evasion of the immune response, changes in clinical manifestations and reducing the efficacy of vaccines or treatments. To date, the multiple resources provide lists of detected mutations without key functional annotations. There is a lack of research examining the relationship between mutations and various factors such as disease severity, pathogenicity, patient age, patient gender, cross-species transmission, viral immune escape, immune response level, viral transmission capability, viral evolution, host adaptability, viral protein structure, viral protein function, viral protein stability and concurrent mutations. Deep understanding the relationship between mutation sites and these factors is crucial for advancing our knowledge of SARS-CoV-2 and for developing effective responses. To fill this gap, we built COV2Var, a function annotation database of SARS-CoV-2 genetic variation, available at http://biomedbdc.wchscu.cn/COV2Var/. COV2Var aims to identify common mutations in SARS-CoV-2 variants and assess their effects, providing a valuable resource for intensive functional annotations of common mutations among SARS-CoV-2 variants.

Assuntos

Bases de Dados Genéticas , SARS-CoV-2 , Humanos , Mutação , SARS-CoV-2/genética , Anotação de Sequência Molecular , Variação Genética

8.

FusionNeoAntigen: a resource of fusion gene-specific neoantigens.

Kumar, Himansu; Luo, Ruihan; Wen, Jianguo; Yang, Chengyuan; Zhou, Xiaobo; Kim, Pora.

Nucleic Acids Res ; 52(D1): D1276-D1288, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37870454

RESUMO

Among the diverse sources of neoantigens (i.e. single-nucleotide variants (SNVs), insertions or deletions (Indels) and fusion genes), fusion gene-derived neoantigens are generally more immunogenic, have multiple targets per mutation and are more widely distributed across various cancer types. Therefore, fusion gene-derived neoantigens are a potential source of highly immunogenic neoantigens and hold great promise for cancer immunotherapy. However, the lack of fusion protein sequence resources and knowledge prevents this application. We introduce 'FusionNeoAntigen', a dedicated resource for fusion-specific neoantigens, accessible at https://compbio.uth.edu/FusionNeoAntigen. In this resource, we provide fusion gene breakpoint crossing neoantigens focused on â¼43K fusion proteins of â¼16K in-frame fusion genes from FusionGDB2.0. FusionNeoAntigen provides fusion gene information, corresponding fusion protein sequences, fusion breakpoint peptide sequences, fusion gene-derived neoantigen prediction, virtual screening between fusion breakpoint peptides having potential fusion neoantigens and human leucocyte antigens (HLAs), fusion breakpoint RNA/protein sequences for developing vaccines, information on samples with fusion-specific neoantigen, potential CAR-T targetable cell-surface fusion proteins and literature curation. FusionNeoAntigen will help to develop fusion gene-based immunotherapies. We will report all potential fusion-specific neoantigens from all possible open reading frames of â¼120K human fusion genes in future versions.

Assuntos

Antígenos de Neoplasias , Bases de Dados Genéticas , Neoplasias , Proteínas de Fusão Oncogênica , Humanos , Antígenos de Neoplasias/genética , Antígenos HLA , Mutação INDEL , Mutação , Neoplasias/genética , Proteínas de Fusão Oncogênica/genética

9.

FusionPDB: a knowledgebase of human fusion proteins.

Kumar, Himansu; Tang, Lin-Ya; Yang, Chengyuan; Kim, Pora.

Nucleic Acids Res ; 52(D1): D1289-D1304, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37870473

RESUMO

Tumorigenic functions due to the formation of fusion genes have been targeted for cancer therapeutics (i.e. kinase inhibitors). However, many fusion proteins involved in various cellular processes have not been studied for targeted therapeutics. This is because the lack of complete fusion protein sequences and their whole 3D structures has made it challenging to develop new therapeutic strategies. To fill these critical gaps, we developed a computational pipeline and a resource of human fusion proteins named FusionPDB, available at https://compbio.uth.edu/FusionPDB. FusionPDB is organized into four levels: 43K fusion protein sequences (14.7K in-frame fusion genes, Level 1), over 2300 + 1267 fusion protein 3D structures (from 2300 recurrent and 266 manually curated in-frame fusion genes, Level 2), pLDDT score analysis for the 1267 fusion proteins from 266 manually curated fusion genes (Level 3), and virtual screening outcomes for 68 selected fusion proteins from 266 manually curated fusion genes (Level 4). FusionPDB is the only resource providing whole 3D structures of fusion proteins and comprehensive knowledge of human fusion proteins. It will be regularly updated until it covers all human fusion proteins in the future.

Assuntos

Bases de Dados de Proteínas , Humanos , Sequência de Aminoácidos , Bases de Conhecimento , Neoplasias/genética , Conformação Proteica

10.

AgeAnnoMO: a knowledgebase of multi-omics annotation for animal aging.

Huang, Kexin; Liu, Xi; Zhang, Zhaocan; Wang, Tiangang; Xu, Haixia; Li, Qingxuan; Jia, Yuhao; Huang, Liyu; Kim, Pora; Zhou, Xiaobo.

Nucleic Acids Res ; 52(D1): D822-D834, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37850649

RESUMO

Aging entails gradual functional decline influenced by interconnected factors. Multiple hallmarks proposed as common and conserved underlying denominators of aging on the molecular, cellular and systemic levels across multiple species. Thus, understanding the function of aging hallmarks and their relationships across species can facilitate the translation of anti-aging drug development from model organisms to humans. Here, we built AgeAnnoMO (https://relab.xidian.edu.cn/AgeAnnoMO/#/), a knowledgebase of multi-omics annotation for animal aging. AgeAnnoMO encompasses an extensive collection of 136 datasets from eight modalities, encompassing 8596 samples from 50 representative species, making it a comprehensive resource for aging and longevity research. AgeAnnoMO characterizes multiple aging regulators across species via multi-omics data, comprehensively annotating aging-related genes, proteins, metabolites, mitochondrial genes, microbiotas and age-specific TCR and BCR sequences tied to aging hallmarks for these species and tissues. AgeAnnoMO not only facilitates a deeper and more generalizable understanding of aging mechanisms, but also provides potential insights of the specificity across tissues and species in aging process, which is important to develop the effective anti-aging interventions for diverse populations. We anticipate that AgeAnnoMO will provide a valuable resource for comprehending and integrating the conserved driving hallmarks in aging biology and identifying the targetable biomarkers for aging research.

Assuntos

Envelhecimento , Bases de Conhecimento , Multiômica , Animais , Humanos , Envelhecimento/genética , Biomarcadores , Longevidade/genética

11.

DRMref: comprehensive reference map of drug resistance mechanisms in human cancer.

Liu, Xiaona; Yi, Jiahao; Li, Tina; Wen, Jianguo; Huang, Kexin; Liu, Jiajia; Wang, Grant; Kim, Pora; Song, Qianqian; Zhou, Xiaobo.

Nucleic Acids Res ; 52(D1): D1253-D1264, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37986230

RESUMO

Drug resistance poses a significant challenge in cancer treatment. Despite the initial effectiveness of therapies such as chemotherapy, targeted therapy and immunotherapy, many patients eventually develop resistance. To gain deep insights into the underlying mechanisms, single-cell profiling has been performed to interrogate drug resistance at cell level. Herein, we have built the DRMref database (https://ccsm.uth.edu/DRMref/) to provide comprehensive characterization of drug resistance using single-cell data from drug treatment settings. The current version of DRMref includes 42 single-cell datasets from 30 studies, covering 382 samples, 13 major cancer types, 26 cancer subtypes, 35 treatment regimens and 42 drugs. All datasets in DRMref are browsable and searchable, with detailed annotations provided. Meanwhile, DRMref includes analyses of cellular composition, intratumoral heterogeneity, epithelial-mesenchymal transition, cell-cell interaction and differentially expressed genes in resistant cells. Notably, DRMref investigates the drug resistance mechanisms (e.g. Aberration of Drug's Therapeutic Target, Drug Inactivation by Structure Modification, etc.) in resistant cells. Additional enrichment analysis of hallmark/KEGG (Kyoto Encyclopedia of Genes and Genomes)/GO (Gene Ontology) pathways, as well as the identification of microRNA, motif and transcription factors involved in resistant cells, is provided in DRMref for user's exploration. Overall, DRMref serves as a unique single-cell-based resource for studying drug resistance, drug combination therapy and discovering novel drug targets.

Assuntos

Bases de Dados Factuais , Resistência a Medicamentos , MicroRNAs , Neoplasias , Humanos , Resistência a Medicamentos/genética , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , MicroRNAs/genética , Neoplasias/tratamento farmacológico , Neoplasias/genética , Internet

12.

Systematic investigation of the homology sequences around the human fusion gene breakpoints in pan-cancer - bioinformatics study for a potential link to MMEJ.

Kim, Pora; Kumar, Himansu; Yang, Chengyuan; Luo, Ruihan; Liu, Jiajia; Zhou, Xiaobo.

Brief Bioinform ; 24(5)2023 09 20.

Artigo em Inglês | MEDLINE | ID: mdl-37635381

RESUMO

Microhomology-mediated end joining (MMEJ), an error-prone DNA damage repair mechanism, frequently leads to chromosomal rearrangements due to its ability to engage in promiscuous end joining of genomic instability and also leads to increasing mutational load at the sequences flanking the breakpoints (BPs). In this study, we systematically investigated the homology sequences around the genomic breakpoint area of human fusion genes, which were formed by the chromosomal rearrangements initiated by DNA double-strand breakage. Since the RNA-seq data is the typical data set to check the fusion genes, for the known exon junction fusion breakpoints identified from RNA-seq data, we have to infer the high chance of genomic breakpoint regions. For this, we utilized the high feature importance score area calculated from our recently developed fusion BP prediction model, FusionAI and identified 151 K microhomologies among ~24 K fusion BPs in 20 K fusion genes. From our multiple bioinformatics studies, we found a relationship between sequence homologies and the immune system. This in-silico study will provide novel knowledge on the sequence homologies around the coded structural variants.

Assuntos

Biologia Computacional , Neoplasias , Humanos , Genômica , Neoplasias/genética , Éxons , Instabilidade Genômica

13.

COVIDanno, COVID-19 annotation in human.

Feng, Yuzhou; Yang, Mengyuan; Fan, Zhiwei; Zhao, Weiling; Kim, Pora; Zhou, Xiaobo.

Front Microbiol ; 14: 1129103, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37497545

RESUMO

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the etiologic agent of coronavirus disease 19 (COVID-19), has caused a global health crisis. Despite ongoing efforts to treat patients, there is no universal prevention or cure available. One of the feasible approaches will be identifying the key genes from SARS-CoV-2-infected cells. SARS-CoV-2-infected in vitro model, allows easy control of the experimental conditions, obtaining reproducible results, and monitoring of infection progression. Currently, accumulating RNA-seq data from SARS-CoV-2 in vitro models urgently needs systematic translation and interpretation. To fill this gap, we built COVIDanno, COVID-19 annotation in humans, available at http://biomedbdc.wchscu.cn/COVIDanno/. The aim of this resource is to provide a reference resource of intensive functional annotations of differentially expressed genes (DEGs) among different time points of COVID-19 infection in human in vitro models. To do this, we performed differential expression analysis for 136 individual datasets across 13 tissue types. In total, we identified 4,935 DEGs. We performed multiple bioinformatics/computational biology studies for these DEGs. Furthermore, we developed a novel tool to help users predict the status of SARS-CoV-2 infection for a given sample. COVIDanno will be a valuable resource for identifying SARS-CoV-2-related genes and understanding their potential functional roles in different time points and multiple tissue types.

14.

Computational design of DNA binding domain-retained fusion proteins and virtual screening against FDA-approved drugs.

Kumar, Himansu; Kim, Pora.

bioRxiv ; 2023 May 08.

Artigo em Inglês | MEDLINE | ID: mdl-37214900

RESUMO

Even though the transcription factors (TFs) are not regarded as good drug targets, mutated or dysregulated TFs can be a unique class of drug targets. Specifically, the TF fusion protein, which is the translated structural variants including TFs may affect downstream to promote tumorigenesis. To date, we lack the fusion protein sequence information and 3D structure information in identifying the potential drugs of fusion proteins. In this study, we predicted the 3D structures of 732 transcription factor fusion proteins (TFFPs). For the top five most frequent TFFPs, we performed the virtual screening across the FDA-approved drugs. Our study will provide an initial platform to develop novel therapeutic targets in the transcription factor fusion proteins.

15.

The Potential Regulation of A-to-I RNA Editing on Genes in Parkinson's Disease.

Wu, Sijia; Xue, Qiuping; Qin, Xinyu; Wu, Xiaoming; Kim, Pora; Chyr, Jacqueline; Zhou, Xiaobo; Huang, Liyu.

Genes (Basel) ; 14(4)2023 04 15.

Artigo em Inglês | MEDLINE | ID: mdl-37107677

RESUMO

Parkinson's disease (PD) is characterized by dopaminergic neurodegeneration and an abnormal accumulation of α-synuclein aggregates. A number of genetic factors have been shown to increase the risk of PD. Exploring the underlying molecular mechanisms that mediate PD's transcriptomic diversity can help us understand neurodegenerative pathogenesis. In this study, we identified 9897 A-to-I RNA editing events associated with 6286 genes across 372 PD patients. Of them, 72 RNA editing events altered miRNA binding sites and this may directly affect miRNA regulations of their host genes. However, RNA editing effects on the miRNA regulation of genes are more complex. They can (1) abolish existing miRNA binding sites, which allows miRNAs to regulate other genes; (2) create new miRNA binding sites that may sequester miRNAs from regulating other genes; or (3) occur in the miRNA seed regions and change their targets. The first two processes are also referred to as miRNA competitive binding. In our study, we found 8 RNA editing events that may alter the expression of 1146 other genes via miRNA competition. We also found one RNA editing event that modified a miRNA seed region, which was predicted to disturb the regulation of four genes. Considering the PD-related functions of the affected genes, 25 A-to-I RNA editing biomarkers for PD are proposed, including the 3 editing events in the EIF2AK2, APOL6, and miR-4477b seed regions. These biomarkers may alter the miRNA regulation of 133 PD-related genes. All these analyses reveal the potential mechanisms and regulations of RNA editing in PD pathogenesis.

Assuntos

MicroRNAs , Doença de Parkinson , Humanos , Doença de Parkinson/genética , Doença de Parkinson/metabolismo , Edição de RNA/genética , MicroRNAs/genética , MicroRNAs/metabolismo , Perfilação da Expressão Gênica , Biomarcadores/metabolismo

16.

The Integrative Studies on the Functional A-to-I RNA Editing Events in Human Cancers.

Wu, Sijia; Fan, Zhiwei; Kim, Pora; Huang, Liyu; Zhou, Xiaobo.

Genomics Proteomics Bioinformatics ; 21(3): 619-631, 2023 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-36708807

RESUMO

Adenosine-to-inosine (A-to-I) RNA editing, constituting nearly 90% of all RNA editing events in humans, has been reported to contribute to the tumorigenesis in diverse cancers. However, the comprehensive map for functional A-to-I RNA editing events in cancers is still insufficient. To fill this gap, we systematically and intensively analyzed multiple tumorigenic mechanisms of A-to-I RNA editing events in samples across 33 cancer types from The Cancer Genome Atlas. For individual candidate among â¼ 1,500,000 quantified RNA editing events, we performed diverse types of downstream functional annotations. Finally, we identified 24,236 potentially functional A-to-I RNA editing events, including the cases in APOL1, IGFBP3, GRIA2, BLCAP, and miR-589-3p. These events might play crucial roles in the scenarios of tumorigenesis, due to their tumor-related editing frequencies or probable effects on altered expression profiles, protein functions, splicing patterns, and microRNA regulations of tumor genes. Our functional A-to-I RNA editing events (https://ccsm.uth.edu/CAeditome/) will help better understand the cancer pathology from the A-to-I RNA editing aspect.

Assuntos

MicroRNAs , Neoplasias , Humanos , Edição de RNA , Neoplasias/genética , Neoplasias/patologia , MicroRNAs/genética , MicroRNAs/metabolismo , Carcinogênese/genética , Apolipoproteína L1/genética , Apolipoproteína L1/metabolismo

17.

Genetic control of RNA editing in neurodegenerative disease.

Wu, Sijia; Xue, Qiuping; Yang, Mengyuan; Wang, Yanfei; Kim, Pora; Zhou, Xiaobo; Huang, Liyu.

Brief Bioinform ; 24(2)2023 03 19.

Artigo em Inglês | MEDLINE | ID: mdl-36681936

RESUMO

A-to-I RNA editing diversifies human transcriptome to confer its functional effects on the downstream genes or regulations, potentially involving in neurodegenerative pathogenesis. Its variabilities are attributed to multiple regulators, including the key factor of genetic variants. To comprehensively investigate the potentials of neurodegenerative disease-susceptibility variants from the view of A-to-I RNA editing, we analyzed matched genetic and transcriptomic data of 1596 samples across nine brain tissues and whole blood from two large consortiums, Accelerating Medicines Partnership-Alzheimer's Disease and Parkinson's Progression Markers Initiative. The large-scale and genome-wide identification of 95 198 RNA editing quantitative trait loci revealed the preferred genetic effects on adjacent editing events. Furthermore, to explore the underlying mechanisms of the genetic controls of A-to-I RNA editing, several top RNA-binding proteins were pointed out, such as EIF4A3, U2AF2, NOP58, FBL, NOP56 and DHX9, since their regulations on multiple RNA-editing events were probably interfered by these genetic variants. Moreover, these variants may also contribute to the variability of other molecular phenotypes associated with RNA editing, including the functions of 3 proteins, expressions of 277 genes and splicing of 449 events. All the analyses results shown in NeuroEdQTL (https://relab.xidian.edu.cn/NeuroEdQTL/) constituted a unique resource for the understanding of neurodegenerative pathogenesis from genotypes to phenotypes related to A-to-I RNA editing.

Assuntos

Doenças Neurodegenerativas , Humanos , Doenças Neurodegenerativas/genética , Edição de RNA , Transcriptoma , Perfilação da Expressão Gênica , Locos de Características Quantitativas , Fator de Iniciação 4A em Eucariotos/genética , RNA Helicases DEAD-box/genética

18.

SPASCER: spatial transcriptomics annotation at single-cell resolution.

Fan, Zhiwei; Luo, Yangyang; Lu, Huifen; Wang, Tiangang; Feng, YuZhou; Zhao, Weiling; Kim, Pora; Zhou, Xiaobo.

Nucleic Acids Res ; 51(D1): D1138-D1149, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36243975

RESUMO

In recent years, the explosive growth of spatial technologies has enabled the characterization of spatial heterogeneity of tissue architectures. Compared to traditional sequencing, spatial transcriptomics reserves the spatial information of each captured location and provides novel insights into diverse spatially related biological contexts. Even though two spatial transcriptomics databases exist, they provide limited analytical information. Information such as spatial heterogeneity of genes and cells, cell-cell communication activities in space, and the cell type compositions in the microenvironment are critical clues to unveil the mechanism of tumorigenesis and embryo differentiation. Therefore, we constructed a new spatial transcriptomics database, named SPASCER (https://ccsm.uth.edu/SPASCER), designed to help understand the heterogeneity of tissue organizations, region-specific microenvironment, and intercellular interactions across tissue architectures at multiple levels. SPASCER contains datasets from 43 studies, including 1082 sub-datasets from 16 organ types across four species. scRNA-seq was integrated to deconvolve/map spatial transcriptomics, and processed with spatial cell-cell interaction, gene pattern and pathway enrichment analysis. Cell-cell interactions and gene regulation network of scRNA-seq from matched spatial transcriptomics were performed as well. The application of SPASCER will provide new insights into tissue architecture and a solid foundation for the mechanistic understanding of many biological processes in healthy and diseased tissues.

Assuntos

Bases de Dados Genéticas , Perfilação da Expressão Gênica , Humanos , Carcinogênese , Comunicação Celular , Diferenciação Celular , Análise de Célula Única , Transcriptoma , Microambiente Tumoral

19.

AgeAnno: a knowledgebase of single-cell annotation of aging in human.

Huang, Kexin; Gong, Hoaran; Guan, Jingjing; Zhang, Lingxiao; Hu, Changbao; Zhao, Weiling; Huang, Liyu; Zhang, Wei; Kim, Pora; Zhou, Xiaobo.

Nucleic Acids Res ; 51(D1): D805-D815, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36200838

RESUMO

Aging is a complex process that accompanied by molecular and cellular alterations. The identification of tissue-/cell type-specific biomarkers of aging and elucidation of the detailed biological mechanisms of aging-related genes at the single-cell level can help to understand the heterogeneous aging process and design targeted anti-aging therapeutics. Here, we built AgeAnno (https://relab.xidian.edu.cn/AgeAnno/#/), a knowledgebase of single cell annotation of aging in human, aiming to provide comprehensive characterizations for aging-related genes across diverse tissue-cell types in human by using single-cell RNA and ATAC sequencing data (scRNA and scATAC). The current version of AgeAnno houses 1 678 610 cells from 28 healthy tissue samples with ages ranging from 0 to 110 years. We collected 5580 aging-related genes from previous resources and performed dynamic functional annotations of the cellular context. For the scRNA data, we performed analyses include differential gene expression, gene variation coefficient, cell communication network, transcription factor (TF) regulatory network, and immune cell proportionc. AgeAnno also provides differential chromatin accessibility analysis, motif/TF enrichment and footprint analysis, and co-accessibility peak analysis for scATAC data. AgeAnno will be a unique resource to systematically characterize aging-related genes across diverse tissue-cell types in human, and it could facilitate antiaging and aging-related disease research.

Assuntos

Envelhecimento , Bases de Conhecimento , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Criança , Pré-Escolar , Humanos , Lactente , Recém-Nascido , Pessoa de Meia-Idade , Adulto Jovem , Cromatina/genética , Análise de Célula Única , Envelhecimento/genética , Envelhecimento/patologia

20.

Phenotype-Genotype analysis of caucasian patients with high risk of osteoarthritis.

Wang, Yanfei; Chyr, Jacqueline; Kim, Pora; Zhao, Weiling; Zhou, Xiaobo.

Front Genet ; 13: 922658, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36105105

RESUMO

Background: Osteoarthritis (OA) is a common cause of disability and pain around the world. Epidemiologic studies of family history have revealed evidence of genetic influence on OA. Although many efforts have been devoted to exploring genetic biomarkers, the mechanism behind this complex disease remains unclear. The identified genetic risk variants only explain a small proportion of the disease phenotype. Traditional genome-wide association study (GWAS) focuses on radiographic evidence of OA and excludes sex chromosome information in the analysis. However, gender differences in OA are multifactorial, with a higher frequency in women, indicating that the chromosome X plays an essential role in OA pathology. Furthermore, the prevalence of comorbidities among patients with OA is high, indicating multiple diseases share a similar genetic susceptibility to OA. Methods: In this study, we performed GWAS of OA and OA-associated key comorbidities on 3366 OA patient data obtained from the Osteoarthritis Initiative (OAI). We performed Mendelian randomization to identify the possible causal relationship between OA and OA-related clinical features. Results: One significant OA-associated locus rs2305570 was identified through sex-specific genome-wide association. By calculating the LD score, we found OA is positively correlated with heart disease and stroke. A strong genetic correlation was observed between knee OA and inflammatory disease, including eczema, multiple sclerosis, and Crohn's disease. Our study also found that knee alignment is one of the major risk factors in OA development, and we surprisingly found knee pain is not a causative factor of OA, although it was the most common symptom of OA. Conclusion: We investigated several significant positive/negative genetic correlations between OA and common chronic diseases, suggesting substantial genetic overlaps between OA and these traits. The sex-specific association analysis supports the critical role of chromosome X in OA development in females.

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA