Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 53
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Hum Mol Genet ; 33(13): 1131-1141, 2024 Jun 21.
Artigo em Inglês | MEDLINE | ID: mdl-38538560

RESUMO

Splicing factors (SFs) are the major RNA-binding proteins (RBPs) and key molecules that regulate the splicing of mRNA molecules through binding to mRNAs. The expression of splicing factors is frequently deregulated in different cancer types, causing the generation of oncogenic proteins involved in cancer hallmarks. In this study, we investigated the genes that encode RNA-binding proteins and identified potential splicing factors that contribute to the aberrant splicing applying a random forest classification model. The result suggested 56 splicing factors were related to the prognosis of 13 cancers, two SF complexes in liver hepatocellular carcinoma, and one SF complex in esophageal carcinoma. Further systematic bioinformatics studies on these cancer prognostic splicing factors and their related alternative splicing events revealed the potential regulations in a cancer-specific manner. Our analysis found high ILF2-ILF3 expression correlates with poor prognosis in LIHC through alternative splicing. These findings emphasize the importance of SFs as potential indicators for prognosis or targets for therapeutic interventions. Their roles in cancer exhibit complexity and are contingent upon the specific context in which they operate. This recognition further underscores the need for a comprehensive understanding and exploration of the role of SFs in different types of cancer, paving the way for their potential utilization in prognostic assessments and the development of targeted therapies.


Assuntos
Processamento Alternativo , Biologia Computacional , Regulação Neoplásica da Expressão Gênica , Aprendizado de Máquina , Neoplasias , Fatores de Processamento de RNA , Humanos , Fatores de Processamento de RNA/genética , Fatores de Processamento de RNA/metabolismo , Prognóstico , Processamento Alternativo/genética , Neoplasias/genética , Biologia Computacional/métodos , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Splicing de RNA/genética , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Neoplasias Hepáticas/genética , Carcinoma Hepatocelular/genética
2.
Brief Bioinform ; 25(2)2024 Jan 22.
Artigo em Inglês | MEDLINE | ID: mdl-38493341

RESUMO

Kinase fusion genes are the most active fusion gene group in human cancer fusion genes. To help choose the clinically significant kinase so that the cancer patients that have fusion genes can be better diagnosed, we need a metric to infer the assessment of kinases in pan-cancer fusion genes rather than relying on the sample frequency expressed fusion genes. Most of all, multiple studies assessed human kinases as the drug targets using multiple types of genomic and clinical information, but none used the kinase fusion genes in their study. The assessment studies of kinase without kinase fusion gene events can miss the effect of one of the mechanisms that enhance the kinase function in cancer. To fill this gap, in this study, we suggest a novel way of assessing genes using a network propagation approach to infer how likely individual kinases influence the kinase fusion gene network composed of ~5K kinase fusion gene pairs. To select a better seed of propagation, we chose the top genes via dimensionality reduction like a principal component or latent layer information of six features of individual genes in pan-cancer fusion genes. Our approach may provide a novel way to assess of human kinases in cancer.


Assuntos
Redes Reguladoras de Genes , Neoplasias , Humanos , Neoplasias/genética , Fusão Gênica
3.
Nucleic Acids Res ; 52(D1): D1289-D1304, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37870473

RESUMO

Tumorigenic functions due to the formation of fusion genes have been targeted for cancer therapeutics (i.e. kinase inhibitors). However, many fusion proteins involved in various cellular processes have not been studied for targeted therapeutics. This is because the lack of complete fusion protein sequences and their whole 3D structures has made it challenging to develop new therapeutic strategies. To fill these critical gaps, we developed a computational pipeline and a resource of human fusion proteins named FusionPDB, available at https://compbio.uth.edu/FusionPDB. FusionPDB is organized into four levels: 43K fusion protein sequences (14.7K in-frame fusion genes, Level 1), over 2300 + 1267 fusion protein 3D structures (from 2300 recurrent and 266 manually curated in-frame fusion genes, Level 2), pLDDT score analysis for the 1267 fusion proteins from 266 manually curated fusion genes (Level 3), and virtual screening outcomes for 68 selected fusion proteins from 266 manually curated fusion genes (Level 4). FusionPDB is the only resource providing whole 3D structures of fusion proteins and comprehensive knowledge of human fusion proteins. It will be regularly updated until it covers all human fusion proteins in the future.


Assuntos
Bases de Dados de Proteínas , Humanos , Sequência de Aminoácidos , Bases de Conhecimento , Neoplasias/genética , Conformação Proteica
4.
Nucleic Acids Res ; 52(D1): D1276-D1288, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37870454

RESUMO

Among the diverse sources of neoantigens (i.e. single-nucleotide variants (SNVs), insertions or deletions (Indels) and fusion genes), fusion gene-derived neoantigens are generally more immunogenic, have multiple targets per mutation and are more widely distributed across various cancer types. Therefore, fusion gene-derived neoantigens are a potential source of highly immunogenic neoantigens and hold great promise for cancer immunotherapy. However, the lack of fusion protein sequence resources and knowledge prevents this application. We introduce 'FusionNeoAntigen', a dedicated resource for fusion-specific neoantigens, accessible at https://compbio.uth.edu/FusionNeoAntigen. In this resource, we provide fusion gene breakpoint crossing neoantigens focused on ∼43K fusion proteins of ∼16K in-frame fusion genes from FusionGDB2.0. FusionNeoAntigen provides fusion gene information, corresponding fusion protein sequences, fusion breakpoint peptide sequences, fusion gene-derived neoantigen prediction, virtual screening between fusion breakpoint peptides having potential fusion neoantigens and human leucocyte antigens (HLAs), fusion breakpoint RNA/protein sequences for developing vaccines, information on samples with fusion-specific neoantigen, potential CAR-T targetable cell-surface fusion proteins and literature curation. FusionNeoAntigen will help to develop fusion gene-based immunotherapies. We will report all potential fusion-specific neoantigens from all possible open reading frames of ∼120K human fusion genes in future versions.


Assuntos
Antígenos de Neoplasias , Bases de Dados Genéticas , Neoplasias , Proteínas de Fusão Oncogênica , Humanos , Antígenos de Neoplasias/genética , Antígenos HLA , Mutação INDEL , Mutação , Neoplasias/genética , Proteínas de Fusão Oncogênica/genética
5.
Nucleic Acids Res ; 52(D1): D701-D713, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37897356

RESUMO

The COVID-19 pandemic, caused by the coronavirus SARS-CoV-2, has resulted in the loss of millions of lives and severe global economic consequences. Every time SARS-CoV-2 replicates, the viruses acquire new mutations in their genomes. Mutations in SARS-CoV-2 genomes led to increased transmissibility, severe disease outcomes, evasion of the immune response, changes in clinical manifestations and reducing the efficacy of vaccines or treatments. To date, the multiple resources provide lists of detected mutations without key functional annotations. There is a lack of research examining the relationship between mutations and various factors such as disease severity, pathogenicity, patient age, patient gender, cross-species transmission, viral immune escape, immune response level, viral transmission capability, viral evolution, host adaptability, viral protein structure, viral protein function, viral protein stability and concurrent mutations. Deep understanding the relationship between mutation sites and these factors is crucial for advancing our knowledge of SARS-CoV-2 and for developing effective responses. To fill this gap, we built COV2Var, a function annotation database of SARS-CoV-2 genetic variation, available at http://biomedbdc.wchscu.cn/COV2Var/. COV2Var aims to identify common mutations in SARS-CoV-2 variants and assess their effects, providing a valuable resource for intensive functional annotations of common mutations among SARS-CoV-2 variants.


Assuntos
Bases de Dados Genéticas , SARS-CoV-2 , Humanos , Mutação , SARS-CoV-2/genética , Anotação de Sequência Molecular , Variação Genética
6.
Nucleic Acids Res ; 52(D1): D1042-D1052, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37953308

RESUMO

StemDriver is a comprehensive knowledgebase dedicated to the functional annotation of genes participating in the determination of hematopoietic stem cell fate, available at http://biomedbdc.wchscu.cn/StemDriver/. By utilizing single-cell RNA sequencing data, StemDriver has successfully assembled a comprehensive lineage map of hematopoiesis, capturing the entire continuum from the initial formation of hematopoietic stem cells to the fully developed mature cells. Extensive exploration and characterization were conducted on gene expression features corresponding to each lineage commitment. At the current version, StemDriver integrates data from 42 studies, encompassing a diverse range of 14 tissue types spanning from the embryonic phase to adulthood. In order to ensure uniformity and reliability, all data undergo a standardized pipeline, which includes quality data pre-processing, cell type annotation, differential gene expression analysis, identification of gene categories correlated with differentiation, analysis of highly variable genes along pseudo-time, and exploration of gene expression regulatory networks. In total, StemDriver assessed the function of 23 839 genes for human samples and 29 533 genes for mouse samples. Simultaneously, StemDriver also provided users with reference datasets and models for cell annotation. We believe that StemDriver will offer valuable assistance to research focused on cellular development and hematopoiesis.


Assuntos
Hematopoese , Células-Tronco Hematopoéticas , Animais , Humanos , Camundongos , Redes Reguladoras de Genes , Hematopoese/genética , Células-Tronco Hematopoéticas/metabolismo , Reprodutibilidade dos Testes , Bases de Conhecimento , Linhagem da Célula
7.
Nucleic Acids Res ; 52(D1): D822-D834, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37850649

RESUMO

Aging entails gradual functional decline influenced by interconnected factors. Multiple hallmarks proposed as common and conserved underlying denominators of aging on the molecular, cellular and systemic levels across multiple species. Thus, understanding the function of aging hallmarks and their relationships across species can facilitate the translation of anti-aging drug development from model organisms to humans. Here, we built AgeAnnoMO (https://relab.xidian.edu.cn/AgeAnnoMO/#/), a knowledgebase of multi-omics annotation for animal aging. AgeAnnoMO encompasses an extensive collection of 136 datasets from eight modalities, encompassing 8596 samples from 50 representative species, making it a comprehensive resource for aging and longevity research. AgeAnnoMO characterizes multiple aging regulators across species via multi-omics data, comprehensively annotating aging-related genes, proteins, metabolites, mitochondrial genes, microbiotas and age-specific TCR and BCR sequences tied to aging hallmarks for these species and tissues. AgeAnnoMO not only facilitates a deeper and more generalizable understanding of aging mechanisms, but also provides potential insights of the specificity across tissues and species in aging process, which is important to develop the effective anti-aging interventions for diverse populations. We anticipate that AgeAnnoMO will provide a valuable resource for comprehending and integrating the conserved driving hallmarks in aging biology and identifying the targetable biomarkers for aging research.


Assuntos
Envelhecimento , Bases de Conhecimento , Multiômica , Animais , Humanos , Envelhecimento/genética , Biomarcadores , Longevidade/genética
8.
Nucleic Acids Res ; 52(D1): D1253-D1264, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37986230

RESUMO

Drug resistance poses a significant challenge in cancer treatment. Despite the initial effectiveness of therapies such as chemotherapy, targeted therapy and immunotherapy, many patients eventually develop resistance. To gain deep insights into the underlying mechanisms, single-cell profiling has been performed to interrogate drug resistance at cell level. Herein, we have built the DRMref database (https://ccsm.uth.edu/DRMref/) to provide comprehensive characterization of drug resistance using single-cell data from drug treatment settings. The current version of DRMref includes 42 single-cell datasets from 30 studies, covering 382 samples, 13 major cancer types, 26 cancer subtypes, 35 treatment regimens and 42 drugs. All datasets in DRMref are browsable and searchable, with detailed annotations provided. Meanwhile, DRMref includes analyses of cellular composition, intratumoral heterogeneity, epithelial-mesenchymal transition, cell-cell interaction and differentially expressed genes in resistant cells. Notably, DRMref investigates the drug resistance mechanisms (e.g. Aberration of Drug's Therapeutic Target, Drug Inactivation by Structure Modification, etc.) in resistant cells. Additional enrichment analysis of hallmark/KEGG (Kyoto Encyclopedia of Genes and Genomes)/GO (Gene Ontology) pathways, as well as the identification of microRNA, motif and transcription factors involved in resistant cells, is provided in DRMref for user's exploration. Overall, DRMref serves as a unique single-cell-based resource for studying drug resistance, drug combination therapy and discovering novel drug targets.


Assuntos
Bases de Dados Factuais , Resistência a Medicamentos , MicroRNAs , Neoplasias , Humanos , Resistência a Medicamentos/genética , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , MicroRNAs/genética , Neoplasias/tratamento farmacológico , Neoplasias/genética , Internet
9.
Brief Bioinform ; 24(5)2023 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-37635381

RESUMO

Microhomology-mediated end joining (MMEJ), an error-prone DNA damage repair mechanism, frequently leads to chromosomal rearrangements due to its ability to engage in promiscuous end joining of genomic instability and also leads to increasing mutational load at the sequences flanking the breakpoints (BPs). In this study, we systematically investigated the homology sequences around the genomic breakpoint area of human fusion genes, which were formed by the chromosomal rearrangements initiated by DNA double-strand breakage. Since the RNA-seq data is the typical data set to check the fusion genes, for the known exon junction fusion breakpoints identified from RNA-seq data, we have to infer the high chance of genomic breakpoint regions. For this, we utilized the high feature importance score area calculated from our recently developed fusion BP prediction model, FusionAI and identified 151 K microhomologies among ~24 K fusion BPs in 20 K fusion genes. From our multiple bioinformatics studies, we found a relationship between sequence homologies and the immune system. This in-silico study will provide novel knowledge on the sequence homologies around the coded structural variants.


Assuntos
Biologia Computacional , Neoplasias , Humanos , Genômica , Neoplasias/genética , Éxons , Instabilidade Genômica
10.
Brief Bioinform ; 24(2)2023 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-36681936

RESUMO

A-to-I RNA editing diversifies human transcriptome to confer its functional effects on the downstream genes or regulations, potentially involving in neurodegenerative pathogenesis. Its variabilities are attributed to multiple regulators, including the key factor of genetic variants. To comprehensively investigate the potentials of neurodegenerative disease-susceptibility variants from the view of A-to-I RNA editing, we analyzed matched genetic and transcriptomic data of 1596 samples across nine brain tissues and whole blood from two large consortiums, Accelerating Medicines Partnership-Alzheimer's Disease and Parkinson's Progression Markers Initiative. The large-scale and genome-wide identification of 95 198 RNA editing quantitative trait loci revealed the preferred genetic effects on adjacent editing events. Furthermore, to explore the underlying mechanisms of the genetic controls of A-to-I RNA editing, several top RNA-binding proteins were pointed out, such as EIF4A3, U2AF2, NOP58, FBL, NOP56 and DHX9, since their regulations on multiple RNA-editing events were probably interfered by these genetic variants. Moreover, these variants may also contribute to the variability of other molecular phenotypes associated with RNA editing, including the functions of 3 proteins, expressions of 277 genes and splicing of 449 events. All the analyses results shown in NeuroEdQTL (https://relab.xidian.edu.cn/NeuroEdQTL/) constituted a unique resource for the understanding of neurodegenerative pathogenesis from genotypes to phenotypes related to A-to-I RNA editing.


Assuntos
Doenças Neurodegenerativas , Humanos , Doenças Neurodegenerativas/genética , Edição de RNA , Transcriptoma , Perfilação da Expressão Gênica , Locos de Características Quantitativas , Fator de Iniciação 4A em Eucariotos/genética , RNA Helicases DEAD-box/genética
11.
Nucleic Acids Res ; 51(D1): D1138-D1149, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36243975

RESUMO

In recent years, the explosive growth of spatial technologies has enabled the characterization of spatial heterogeneity of tissue architectures. Compared to traditional sequencing, spatial transcriptomics reserves the spatial information of each captured location and provides novel insights into diverse spatially related biological contexts. Even though two spatial transcriptomics databases exist, they provide limited analytical information. Information such as spatial heterogeneity of genes and cells, cell-cell communication activities in space, and the cell type compositions in the microenvironment are critical clues to unveil the mechanism of tumorigenesis and embryo differentiation. Therefore, we constructed a new spatial transcriptomics database, named SPASCER (https://ccsm.uth.edu/SPASCER), designed to help understand the heterogeneity of tissue organizations, region-specific microenvironment, and intercellular interactions across tissue architectures at multiple levels. SPASCER contains datasets from 43 studies, including 1082 sub-datasets from 16 organ types across four species. scRNA-seq was integrated to deconvolve/map spatial transcriptomics, and processed with spatial cell-cell interaction, gene pattern and pathway enrichment analysis. Cell-cell interactions and gene regulation network of scRNA-seq from matched spatial transcriptomics were performed as well. The application of SPASCER will provide new insights into tissue architecture and a solid foundation for the mechanistic understanding of many biological processes in healthy and diseased tissues.


Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica , Humanos , Carcinogênese , Comunicação Celular , Diferenciação Celular , Análise de Célula Única , Transcriptoma , Microambiente Tumoral
12.
Nucleic Acids Res ; 51(D1): D805-D815, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36200838

RESUMO

Aging is a complex process that accompanied by molecular and cellular alterations. The identification of tissue-/cell type-specific biomarkers of aging and elucidation of the detailed biological mechanisms of aging-related genes at the single-cell level can help to understand the heterogeneous aging process and design targeted anti-aging therapeutics. Here, we built AgeAnno (https://relab.xidian.edu.cn/AgeAnno/#/), a knowledgebase of single cell annotation of aging in human, aiming to provide comprehensive characterizations for aging-related genes across diverse tissue-cell types in human by using single-cell RNA and ATAC sequencing data (scRNA and scATAC). The current version of AgeAnno houses 1 678 610 cells from 28 healthy tissue samples with ages ranging from 0 to 110 years. We collected 5580 aging-related genes from previous resources and performed dynamic functional annotations of the cellular context. For the scRNA data, we performed analyses include differential gene expression, gene variation coefficient, cell communication network, transcription factor (TF) regulatory network, and immune cell proportionc. AgeAnno also provides differential chromatin accessibility analysis, motif/TF enrichment and footprint analysis, and co-accessibility peak analysis for scATAC data. AgeAnno will be a unique resource to systematically characterize aging-related genes across diverse tissue-cell types in human, and it could facilitate antiaging and aging-related disease research.


Assuntos
Envelhecimento , Bases de Conhecimento , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Criança , Pré-Escolar , Humanos , Lactente , Recém-Nascido , Pessoa de Meia-Idade , Adulto Jovem , Cromatina/genética , Análise de Célula Única , Envelhecimento/genética , Envelhecimento/patologia
13.
Nucleic Acids Res ; 50(D1): D1295-D1306, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34791419

RESUMO

The long non-coding RNAs associating with other molecules can coordinate several physiological processes and their dysfunction can impact diverse human diseases. To date, systematic and intensive annotations on diverse interaction regulations of lncRNAs in human cancer were not available. Here, we built lncRNAfunc, a knowledgebase of lncRNA function in human cancer at https://ccsm.uth.edu/lncRNAfunc, aiming to provide a resource and reference for providing therapeutically targetable lncRNAs and intensive interaction regulations. To do this, we collected 15 900 lncRNAs across 33 cancer types from TCGA. For individual lncRNAs, we performed multiple interaction analyses of different biomolecules including DNA, RNA, and protein levels. Our intensive studies of lncRNAs provide diverse potential mechanisms of lncRNAs that regulate gene expression through binding enhancers and 3'-UTRs of genes, competing for miRNA binding sites with mRNAs, recruiting the transcription factors to gene promoters. Furthermore, we investigated lncRNAs that potentially affect the alternative splicing events through interacting with RNA binding Proteins. We also performed multiple functional annotations including cancer stage-associated lncRNAs, RNA A-to-I editing event-associated lncRNAs, and lncRNA expression quantitative trait loci. lncRNAfunc is a unique resource for cancer research communities to help better understand potential lncRNA regulations and therapeutic lncRNA targets.


Assuntos
Bases de Dados Genéticas , Bases de Conhecimento , Neoplasias/genética , RNA Longo não Codificante/genética , Processamento Alternativo/genética , Humanos , Neoplasias/classificação , RNA Longo não Codificante/classificação , RNA Mensageiro/genética
14.
Nucleic Acids Res ; 50(D1): D1221-D1230, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34755868

RESUMO

A knowledgebase of the systematic functional annotation of fusion genes is critical for understanding genomic breakage context and developing therapeutic strategies. FusionGDB is a unique functional annotation database of human fusion genes and has been widely used for studies with diverse aims. In this study, we report fusion gene annotation updates aided by deep learning (FusionGDB 2.0) available at https://compbio.uth.edu/FusionGDB2/. FusionGDB 2.0 has substantial updates of contents such as up-to-date human fusion genes, fusion gene breakage tendency score with FusionAI deep learning model based on 20 kb DNA sequence around BP, investigation of overlapping between fusion breakpoints with 44 human genomic features across five cellular role's categories, transcribed chimeric sequence and following open reading frame analysis with coding potential based on deep learning approach with Ribo-seq read features, and rigorous investigation of the protein feature retention of individual fusion partner genes in the protein level. Among ∼102k fusion genes, about 15k kept their ORF as In-frames, which is two times compared to the previous version, FusionGDB. FusionGDB 2.0 will be used as the reference knowledgebase of fusion gene annotations. FusionGDB 2.0 provides eight categories of annotations and it will be helpful for diverse human genomic studies.


Assuntos
Bases de Dados Genéticas , Fusão Gênica/genética , Genoma Humano/genética , Genômica , Sequência de Aminoácidos/genética , Aprendizado Profundo , Humanos , Bases de Conhecimento , Anotação de Sequência Molecular
15.
Brief Bioinform ; 22(5)2021 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-33401309

RESUMO

A-to-I RNA editing, contributing to nearly 90% of all editing events in human, has been reported to involve in the pathogenesis of Alzheimer's disease (AD) due to its roles in brain development and immune regulation, such as the deficient editing of GluA2 Q/R related to cell death and memory loss. Currently, there are urgent needs for the systematic annotations of A-to-I RNA editing events in AD. Here, we built ADeditome, the annotation database of A-to-I RNA editing in AD available at https://ccsm.uth.edu/ADeditome, aiming to provide a resource and reference for functional annotation of A-to-I RNA editing in AD to identify therapeutically targetable genes in an individual. We detected 1676 363 editing sites in 1524 samples across nine brain regions from ROSMAP, MayoRNAseq and MSBB. For these editing events, we performed multiple functional annotations including identification of specific and disease stage associated editing events and the influence of editing events on gene expression, protein recoding, alternative splicing and miRNA regulation for all the genes, especially for AD-related genes in order to explore the pathology of AD. Combing all the analysis results, we found 108 010 and 26 168 editing events which may promote or inhibit AD progression, respectively. We also found 5582 brain region-specific editing events with potentially dual roles in AD across different brain regions. ADeditome will be a unique resource for AD and drug research communities to identify therapeutically targetable editing events. Significance: ADeditome is the first comprehensive resource of the functional genomics of individual A-to-I RNA editing events in AD, which will be useful for many researchers in the fields of AD pathology, precision medicine, and therapeutic researches.


Assuntos
Adenosina/metabolismo , Doença de Alzheimer/genética , Amnésia/genética , Inosina/metabolismo , Proteínas do Tecido Nervoso/genética , Edição de RNA , Transcriptoma , Adenosina/genética , Processamento Alternativo , Doença de Alzheimer/metabolismo , Doença de Alzheimer/patologia , Amnésia/metabolismo , Amnésia/patologia , Encéfalo/metabolismo , Encéfalo/patologia , Mapeamento Encefálico , Bases de Dados Genéticas , Ontologia Genética , Humanos , Inosina/genética , MicroRNAs/classificação , MicroRNAs/genética , MicroRNAs/metabolismo , Anotação de Sequência Molecular , Proteínas do Tecido Nervoso/classificação , Proteínas do Tecido Nervoso/metabolismo , Receptores de AMPA/genética , Receptores de AMPA/metabolismo
16.
Brief Bioinform ; 22(3)2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-32436932

RESUMO

It has been increasingly accepted that microRNA (miRNA) can both activate and suppress gene expression, directly or indirectly, under particular circumstances. Yet, a systematic study on the switch in their interaction pattern between activation and suppression and between normal and cancer conditions based on multi-omics evidences is not available. We built miRactDB, a database for miRNA-gene interaction, at https://ccsm.uth.edu/miRactDB, to provide a versatile resource and platform for annotation and interpretation of miRNA-gene relations. We conducted a comprehensive investigation on miRNA-gene interactions and their biological implications across tissue types in both tumour and normal conditions, based on TCGA, CCLE and GTEx databases. We particularly explored the genetic and epigenetic mechanisms potentially contributing to the positive correlation, including identification of miRNA binding sites in the gene coding sequence (CDS) and promoter regions of partner genes. Integrative analysis based on this resource revealed that top-ranked genes derived from TCGA tumour and adjacent normal samples share an overwhelming part of biological processes, which are quite different than those from CCLE and GTEx. The most active miRNAs predicted to target CDS and promoter regions are largely overlapped. These findings corroborate that adjacent normal tissues might have undergone significant molecular transformations towards oncogenesis before phenotypic and histological change; and there probably exists a small yet critical set of miRNAs that profoundly influence various cancer hallmark processes. miRactDB provides a unique resource for the cancer and genomics communities to screen, prioritize and rationalize their candidates of miRNA-gene interactions, in both normal and cancer scenarios.


Assuntos
Regulação Neoplásica da Expressão Gênica , MicroRNAs/genética , Neoplasias/genética , Sítios de Ligação , Carcinogênese/genética , Epigênese Genética , Humanos , Regiões Promotoras Genéticas
17.
Brief Bioinform ; 22(5)2021 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-33497435

RESUMO

Exon skipping (ES), the most common alternative splicing event, has been reported to contribute to diverse human diseases due to the loss of functional domains/sites or frameshifting of the open reading frame (ORF) and noticed as therapeutic targets. Accumulating transcriptomic studies of aging brains show the splicing disruption is a widespread hallmark of neurodegenerative diseases such as Alzheimer's disease (AD). Here, we built ExonSkipAD, the ES annotation database aiming to provide a resource/reference for functional annotation of ES events in AD and identify therapeutic targets in exon units. We identified 16 414 genes that have ~156 K, ~ 69 K, ~ 231 K ES events from the three representative AD cohorts of ROSMAP, MSBB and Mayo, respectively. For these ES events, we performed multiple functional annotations relating to ES mechanisms or downstream. Specifically, through the functional feature retention studies followed by the open reading frames (ORFs), we identified 275 important cellular regulators that might lose their cellular regulator roles due to exon skipping in AD. ExonSkipAD provides twelve categories of annotations: gene summary, gene structures and expression levels, exon skipping events with PSIs, ORF annotation, exon skipping events in the canonical protein sequence, 3'-UTR located exon skipping events lost miRNA-binding sites, SNversus in the skipped exons with a depth of coverage, AD stage-associated exon skipping events, splicing quantitative trait loci (sQTLs) in the skipped exons, correlation with RNA-binding proteins, and related drugs & diseases. ExonSkipAD will be a unique resource of transcriptomic diversity research for understanding the mechanisms of neurodegenerative disease development and identifying potential therapeutic targets in AD. Significance AS the first comprehensive resource of the functional genomics of the alternative splicing events in AD, ExonSkipAD will be useful for many researchers in the fields of pathology, AD genomics and precision medicine, and pharmaceutical and therapeutic researches.


Assuntos
Regiões 3' não Traduzidas , Processamento Alternativo , Doença de Alzheimer , Bases de Dados de Ácidos Nucleicos , Éxons , Transcriptoma , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Humanos , MicroRNAs , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo
18.
Brief Bioinform ; 22(3)2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-32510566

RESUMO

More than 48 kinase inhibitors (KIs) have been approved by Food and Drug Administration. However, drug-resistance (DR) eventually occurs, and secondary mutations have been found in the previously targeted primary-mutated cancer cells. Cancer and drug research communities recognize the importance of the kinase domain (KD) mutations for kinasopathies. So far, a systematic investigation of kinase mutations on DR hotspots has not been done yet. In this study, we systematically investigated four types of representative mutation hotspots (gatekeeper, G-loop, αC-helix and A-loop) associated with DR in 538 human protein kinases using large-scale cancer data sets (TCGA, ICGC, COSMIC and GDSC). Our results revealed 358 kinases harboring 3318 mutations that covered 702 drug resistance hotspot residues. Among them, 197 kinases had multiple genetic variants on each residue. We further computationally assessed and validated the epidermal growth factor receptor mutations on protein structure and drug-binding efficacy. This is the first study to provide a landscape view of DR-associated mutation hotspots in kinase's secondary structures, and its knowledge will help the development of effective next-generation KIs for better precision medicine.


Assuntos
Bases de Dados de Proteínas , Resistencia a Medicamentos Antineoplásicos/genética , Proteínas de Neoplasias/genética , Neoplasias/genética , Receptores ErbB/genética , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/enzimologia , Estrutura Secundária de Proteína
19.
Nucleic Acids Res ; 48(W1): W313-W320, 2020 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-32421816

RESUMO

Among the diverse location of the breakpoints (BPs) of structural variants (SVs), the breakpoints of fusion genes (FGs) are located in the gene bodies. This broken gene context provided the aberrant functional clues to study disease genesis. Many tumorigenic fusion genes have retained or lost functional or regulatory domains and these features impacted tumorigenesis. Full annotation of fusion genes aided by the visualization tool based on two gene bodies will be helpful to study the functional aspect of fusion genes. To date, a specialized tool with effective visualization of the functional features of fusion genes is not available. In this study, we built FGviewer, a tool for visualizing functional features of human fusion genes, which is available at https://ccsmweb.uth.edu/FGviewer. FGviewer gets the input of fusion gene symbols, breakpoint information, or structural variants from whole-genome sequence (WGS) data. For any combination of gene pairs/breakpoints to be involved in fusion genes, the users can search the functional/regulatory aspect of the fusion gene in the three bio-molecular levels (DNA-, RNA-, and protein-levels) and one clinical level (pathogenic-level). FGviewer will be a unique online tool in disease research communities.


Assuntos
Fusão Gênica , Software , Sequência de Aminoácidos , Gráficos por Computador , Doença/genética , Humanos , Fatores de Transcrição/metabolismo , Sequenciamento Completo do Genoma
20.
Nucleic Acids Res ; 48(D1): D896-D907, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31642488

RESUMO

Exon skipping (ES) is reported to be the most common alternative splicing event due to loss of functional domains/sites or shifting of the open reading frame (ORF), leading to a variety of human diseases and considered therapeutic targets. To date, systematic and intensive annotations of ES events based on the skipped exon units in cancer and normal tissues are not available. Here, we built ExonSkipDB, the ES annotation database available at https://ccsm.uth.edu/ExonSkipDB/, aiming to provide a resource and reference for functional annotation of ES events in multiple cancer and tissues to identify therapeutically targetable genes in individual exon units. We collected 14 272 genes that have 90 616 and 89 845 ES events across 33 cancer types and 31 normal tissues from The Cancer Genome Atlas (TCGA) and Genotype-Tissue Expression (GTEx). For the ES events, we performed multiple functional annotations. These include ORF assignment of exon skipped transcript, studies of lost protein functional features due to ES events, and studies of exon skipping events associated with mutations and methylations based on multi-omics evidence. ExonSkipDB will be a unique resource for cancer and drug research communities to identify therapeutically targetable exon skipping events.


Assuntos
Processamento Alternativo/genética , Bases de Dados Genéticas , Éxons/genética , Genoma Humano/genética , Mutação/genética , Neoplasias/genética , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA