Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 35
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36515158

RESUMO

The development of targeted drugs allows precision medicine in cancer treatment and optimal targeted therapies. Accurate identification of cancer druggable genes helps strengthen the understanding of targeted cancer therapy and promotes precise cancer treatment. However, rare cancer-druggable genes have been found due to the multi-omics data's diversity and complexity. This study proposes deep forest for cancer druggable genes discovery (DF-CAGE), a novel machine learning-based method for cancer-druggable gene discovery. DF-CAGE integrated the somatic mutations, copy number variants, DNA methylation and RNA-Seq data across ˜10 000 TCGA profiles to identify the landscape of the cancer-druggable genes. We found that DF-CAGE discovers the commonalities of currently known cancer-druggable genes from the perspective of multi-omics data and achieved excellent performance on OncoKB, Target and Drugbank data sets. Among the ˜20 000 protein-coding genes, DF-CAGE pinpointed 465 potential cancer-druggable genes. We found that the candidate cancer druggable genes (CDG) are clinically meaningful and divided the CDG into known, reliable and potential gene sets. Finally, we analyzed the omics data's contribution to identifying druggable genes. We found that DF-CAGE reports druggable genes mainly based on the copy number variations (CNVs) data, the gene rearrangements and the mutation rates in the population. These findings may enlighten the future study and development of new drugs.


Assuntos
Genômica , Neoplasias , Humanos , Genômica/métodos , Multiômica , Variações do Número de Cópias de DNA , Neoplasias/tratamento farmacológico , Neoplasias/genética , Aprendizado de Máquina , Estudos de Associação Genética
2.
J Transl Med ; 22(1): 302, 2024 Mar 24.
Artigo em Inglês | MEDLINE | ID: mdl-38521921

RESUMO

BACKGROUND: Myasthenia gravis (MG) is a chronic autoimmune disorder characterized by fluctuating muscle weakness. Despite the availability of established therapies, the management of MG symptoms remains suboptimal, partially attributed to lack of efficacy or intolerable side-effects. Therefore, new effective drugs are warranted for treatment of MG. METHODS: By employing an analytical framework that combines Mendelian randomization (MR) and colocalization analysis, we estimate the causal effects of blood druggable expression quantitative trait loci (eQTLs) and protein quantitative trait loci (pQTLs) on the susceptibility of MG. We subsequently investigated whether potential genetic effects exhibit cell-type specificity by utilizing genetic colocalization analysis to assess the interplay between immune-cell-specific eQTLs and MG risk. RESULTS: We identified significant MR results for four genes (CDC42BPB, CD226, PRSS36, and TNFSF12) using cis-eQTL genetic instruments and three proteins (CTSH, PRSS8, and CPN2) using cis-pQTL genetic instruments. Six of these loci demonstrated evidence of colocalization with MG susceptibility (posterior probability > 0.80). We next undertook genetic colocalization to investigate cell-type-specific effects at these loci. Notably, we identified robust evidence of colocalization, with a posterior probability of 0.854, linking CTSH expression in TH2 cells and MG risk. CONCLUSIONS: This study provides crucial insights into the genetic and molecular factors associated with MG susceptibility, singling out CTSH as a potential candidate for in-depth investigation and clinical consideration. It additionally sheds light on the immune-cell regulatory mechanisms related to the disease. However, further research is imperative to validate these targets and evaluate their feasibility for drug development.


Assuntos
Predisposição Genética para Doença , Miastenia Gravis , Humanos , Multiômica , Estudo de Associação Genômica Ampla , Miastenia Gravis/genética , Locos de Características Quantitativas/genética , Polimorfismo de Nucleotídeo Único/genética
3.
Hum Genomics ; 16(1): 8, 2022 03 04.
Artigo em Inglês | MEDLINE | ID: mdl-35246263

RESUMO

Coronary artery disease (CAD) is a multifactorial disorder, which is partly heritable. Herein, we implemented a mapping of CAD-associated candidate genes by using genome-wide enhancer-promoter conformation (H3K27ac-HiChIP) and expression quantitative trait loci (eQTL). Enhancer-promoter anchor loops from human coronary artery smooth muscle cells (HCASMC) explained 22% of the heritability for CAD. 3D enhancer-promoter genome mapping of CAD-genes in HCASMC was enriched in vascular eQTL genes. By using colocalization and Mendelian randomization analyses, we identified 58 causal candidate vascular genes including some druggable targets (MAP3K11, CAMK1D, PDGFD, IPO9 and CETP). A network analysis of causal candidate genes was enriched in TGF beta and MAPK pathways. The pharmacologic inhibition of causal candidate gene MAP3K11 in vascular SMC reduced the expression of athero-relevant genes and lowered cell migration, a cardinal process in CAD. Genes connected to enhancers are enriched in vascular eQTL and druggable genes causally associated with CAD.


Assuntos
Doença da Artéria Coronariana , Doença da Artéria Coronariana/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Análise da Randomização Mendeliana , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética
4.
BMC Bioinformatics ; 23(1): 37, 2022 Jan 12.
Artigo em Inglês | MEDLINE | ID: mdl-35021991

RESUMO

BACKGROUND: LINCS, "Library of Integrated Network-based Cellular Signatures", and IDG, "Illuminating the Druggable Genome", are both NIH projects and consortia that have generated rich datasets for the study of the molecular basis of human health and disease. LINCS L1000 expression signatures provide unbiased systems/omics experimental evidence. IDG provides compiled and curated knowledge for illumination and prioritization of novel drug target hypotheses. Together, these resources can support a powerful new approach to identifying novel drug targets for complex diseases, such as Parkinson's disease (PD), which continues to inflict severe harm on human health, and resist traditional research approaches. RESULTS: Integrating LINCS and IDG, we built the Knowledge Graph Analytics Platform (KGAP) to support an important use case: identification and prioritization of drug target hypotheses for associated diseases. The KGAP approach includes strong semantics interpretable by domain scientists and a robust, high performance implementation of a graph database and related analytical methods. Illustrating the value of our approach, we investigated results from queries relevant to PD. Approved PD drug indications from IDG's resource DrugCentral were used as starting points for evidence paths exploring chemogenomic space via LINCS expression signatures for associated genes, evaluated as target hypotheses by integration with IDG. The KG-analytic scoring function was validated against a gold standard dataset of genes associated with PD as elucidated, published mechanism-of-action drug targets, also from DrugCentral. IDG's resource TIN-X was used to rank and filter KGAP results for novel PD targets, and one, SYNGR3 (Synaptogyrin-3), was manually investigated further as a case study and plausible new drug target for PD. CONCLUSIONS: The synergy of LINCS and IDG, via KG methods, empowers graph analytics methods for the investigation of the molecular basis of complex diseases, and specifically for identification and prioritization of novel drug targets. The KGAP approach enables downstream applications via integration with resources similarly aligned with modern KG methodology. The generality of the approach indicates that KGAP is applicable to many disease areas, in addition to PD, the focus of this paper.


Assuntos
Doença de Parkinson , Biblioteca Gênica , Genoma , Humanos , Iluminação , Doença de Parkinson/tratamento farmacológico , Doença de Parkinson/genética , Reconhecimento Automatizado de Padrão
5.
BMC Genomics ; 23(1): 588, 2022 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-35964012

RESUMO

BACKGROUND: Heart failure (HF) is a prevalent cause of mortality and morbidity. The molecular drivers of HF are still largely unknown. RESULTS: We aimed to identify circulating proteins causally associated with HF by leveraging genome-wide genetic association data for HF including 47,309 cases and 930,014 controls. We performed two-sample Mendelian randomization (MR) with multiple cis instruments as well as network and enrichment analysis using data from blood protein quantitative trait loci (pQTL) (2,965 blood proteins) measured in 3,301 individuals. Nineteen blood proteins were causally associated with HF, were not subject to reverse causality and were enriched in ligand-receptor and glycosylation molecules. Network pathway analysis of the blood proteins showed enrichment in NF-kappa B, TGF beta, lipid in atherosclerosis and fluid shear stress. Cross-phenotype analysis of HF identified genetic overlap with cardiovascular drugs, myocardial infarction, parental longevity and low-density cholesterol. Multi-trait MR identified causal associations between HF-associated blood proteins and cardiovascular outcomes. Multivariable MR showed that association of BAG3, MIF and APOA5 with HF were mediated by the blood pressure and coronary artery disease. According to the directional effect and biological action, 7 blood proteins are targets of existing drugs or are tractable for the development of novel therapeutics. Among the pathways, sialyl Lewis x and the activin type II receptor are potential druggable candidates. CONCLUSIONS: Integrative MR analyses of the blood proteins identified causally-associated proteins with HF and revealed pleiotropy of the blood proteome with cardiovascular risk factors. Some of the proteins or pathway related mechanisms could be targeted as novel treatment approach in HF.


Assuntos
Insuficiência Cardíaca , Proteoma , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Proteínas Reguladoras de Apoptose , Proteínas Sanguíneas/metabolismo , Insuficiência Cardíaca/genética , Humanos , Análise da Randomização Mendeliana , Proteoma/metabolismo , Fatores de Risco
6.
Genomics ; 112(2): 1734-1745, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-31678593

RESUMO

The Brucella melitensis chronic infection and drug resistance emerged as a severe health problem in humans and domestic cattle. The pathogens fast genome sequences availability fetched the possibility to address novel therapeutics targets in a rationale way. We acquired the core genes set from 56 B. melitensis publically available complete genome sequences. A stringent bioinformatics layout of comparative genomics and reverse vaccinology was followed to identify potential druggable proteins and multi-epitope vaccine constructs from core genes. The 23 proteins were shortlisted as novel druggable targets based on their role in pathogen-specific metabolic pathways, non-homologous to human and human gut microbiome proteins and their druggability potential. Furthermore, potential chimeric vaccine constructs were generated from lead T and B-cell overlapped epitopes in combination with immune enhancer adjuvants and linkers sequences. The molecular docking and MD simulation analyses ensured stable molecular interaction of a finally prioritized vaccine construct with human immune cells receptors.


Assuntos
Proteínas de Bactérias/química , Vacina contra Brucelose/química , Brucella melitensis/imunologia , Genoma Bacteriano , Linfócitos B/imunologia , Proteínas de Bactérias/genética , Proteínas de Bactérias/imunologia , Vacina contra Brucelose/genética , Vacina contra Brucelose/imunologia , Brucella melitensis/genética , Epitopos/química , Epitopos/imunologia , Humanos , Imunogenicidade da Vacina , Simulação de Acoplamento Molecular , Ligação Proteica , Linfócitos T/imunologia
7.
Int J Mol Sci ; 22(2)2021 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-33429995

RESUMO

We describe the assembly and annotation of a chemogenomic set of protein kinase inhibitors as an open science resource for studying kinase biology. The set only includes inhibitors that show potent kinase inhibition and a narrow spectrum of activity when screened across a large panel of kinase biochemical assays. Currently, the set contains 187 inhibitors that cover 215 human kinases. The kinase chemogenomic set (KCGS), current Version 1.0, is the most highly annotated set of selective kinase inhibitors available to researchers for use in cell-based screens.


Assuntos
Descoberta de Drogas , Inibidores de Proteínas Quinases/química , Proteínas Serina-Treonina Quinases/química , Bibliotecas de Moléculas Pequenas/química , Humanos , Inibidores de Proteínas Quinases/uso terapêutico , Proteínas Serina-Treonina Quinases/antagonistas & inibidores , Bibliotecas de Moléculas Pequenas/uso terapêutico , Relação Estrutura-Atividade
8.
BMC Cancer ; 19(1): 1005, 2019 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-31655559

RESUMO

BACKGROUND: Acute T-cell lymphoblastic leukaemia (T-ALL) is an aggressive disorder derived from immature thymocytes. The variability observed in clinical responses on this type of tumours to treatments, the high toxicity of current protocols and the poor prognosis of patients with relapse or refractory make it urgent to find less toxic and more effective therapies in the context of a personalized medicine of precision. METHODS: Whole exome sequencing and RNAseq were performed on DNA and RNA respectively, extracted of a bone marrow sample from a patient diagnosed with tumour primary T-ALL and double negative thymocytes from thymus control samples. We used PanDrugs, a computational resource to propose pharmacological therapies based on our experimental results, including lists of variants and genes. We extend the possible therapeutic options for the patient by taking into account multiple genomic events potentially sensitive to a treatment, the context of the pathway and the pharmacological evidence already known by large-scale experiments. RESULTS: As a proof-of-principle we used next-generation-sequencing technologies (Whole Exome Sequencing and RNA-Sequencing) in a case of diagnosed Pro-T acute lymphoblastic leukaemia. We identified 689 disease-causing mutations involving 308 genes, as well as multiple fusion transcript variants, alternative splicing, and 6652 genes with at least one principal isoform significantly deregulated. Only 12 genes, with 27 pathogenic gene variants, were among the most frequently mutated ones in this type of lymphoproliferative disorder. Among them, 5 variants detected in CTCF, FBXW7, JAK1, NOTCH1 and WT1 genes have not yet been reported in T-ALL pathogenesis. CONCLUSIONS: Personalized genomic medicine is a therapeutic approach involving the use of an individual's information data to tailor drug therapy. Implementing bioinformatics platform PanDrugs enables us to propose a prioritized list of anticancer drugs as the best theoretical therapeutic candidates to treat this patient has been the goal of this article. Of note, most of the proposed drugs are not being yet considered in the clinical practice of this type of cancer opening up the approach of new treatment possibilities.


Assuntos
Antineoplásicos/uso terapêutico , Genoma Humano/genética , Genômica/métodos , Medicina de Precisão/métodos , Leucemia-Linfoma Linfoblástico de Células T Precursoras/tratamento farmacológico , Adolescente , Processamento Alternativo/genética , Exoma/genética , Fusão Gênica/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Mutação/genética , RNA-Seq , Espanha , Transcriptoma/genética
9.
J Biol Chem ; 290(32): 19471-7, 2015 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-26100629

RESUMO

G-protein-coupled receptors (GPCRs) are frequent and fruitful targets for drug discovery and development, as well as being off-targets for the side effects of a variety of medications. Much of the druggable non-olfactory human GPCR-ome remains under-interrogated, and we present here various approaches that we and others have used to shine light into these previously dark corners of the human genome.


Assuntos
Drogas em Investigação/farmacologia , Genoma Humano , Proteoma/química , Receptores Acoplados a Proteínas G/metabolismo , Bibliotecas de Moléculas Pequenas/farmacologia , Sítios de Ligação , Bases de Dados Bibliográficas , Bases de Dados de Compostos Químicos , Bases de Dados de Produtos Farmacêuticos , Drogas em Investigação/síntese química , Humanos , Ligantes , Ligação Proteica , Proteoma/metabolismo , Receptores Acoplados a Proteínas G/agonistas , Receptores Acoplados a Proteínas G/antagonistas & inibidores , Receptores Acoplados a Proteínas G/genética , Bibliotecas de Moléculas Pequenas/síntese química
10.
Drug Discov Today ; 29(3): 103882, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38218214

RESUMO

The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations. Based on this knowledge, proteins are classified into different Target Development Levels (TDLs) for identification of understudied targets. Additional work by the KMC focuses on enriching target knowledge and producing DrugCentral and other data visualization tools for expanding investigation of understudied targets.


Assuntos
Genoma , Gestão do Conhecimento , Humanos , Proteoma , Bases de Dados Factuais , Informática
11.
Drug Discov Today ; 29(5): 103953, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38508231

RESUMO

The Illuminating the Druggable Genome (IDG) consortium generated reagents, biological model systems, data, informatic databases, and computational tools. The Resource Dissemination and Outreach Center (RDOC) played a central administrative role, organized internal meetings, fostered collaboration, and coordinated consortium-wide efforts. The RDOC developed and deployed a Resource Management System (RMS) to enable efficient workflows for collecting, accessing, validating, registering, and publishing resource metadata. IDG policies for repositories and standardized representations of resources were established, adopting the FAIR (findable, accessible, interoperable, reusable) principles. The RDOC also developed metrics of IDG impact. Outreach initiatives included digital content, the Protein Illumination Timeline (representing milestones in generating data and reagents), the Target Watch publication series, the e-IDG Symposium series, and leveraging social media platforms.


Assuntos
Disseminação de Informação , Humanos , Bases de Dados Factuais
12.
Drug Discov Today ; 29(3): 103881, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38218213

RESUMO

The human kinome, with more than 500 proteins, is crucial for cell signaling and disease. Yet, about one-third of kinases lack in-depth study. The Data and Resource Generating Center for Understudied Kinases has developed multiple resources to address this challenge including creation of a heavy amino acid peptide library for parallel reaction monitoring and quantitation of protein kinase expression, use of understudied kinases tagged with a miniTurbo-biotin ligase to determine interaction networks by proximity-dependent protein biotinylation, NanoBRET probe development for screening chemical tool target specificity in live cells, characterization of small molecule chemical tools inhibiting understudied kinases, and computational tools for defining kinome architecture. These resources are available through the Dark Kinase Knowledgebase, supporting further research into these understudied protein kinases.


Assuntos
Proteínas Quinases , Proteínas , Humanos , Proteínas Quinases/metabolismo , Proteômica
13.
BMC Med Genomics ; 17(1): 157, 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38862968

RESUMO

Primary Sclerosing Cholangitis (PSC) is a progressive cholestatic liver disease with no licensed therapies. Previous Genome Wide Association Studies (GWAS) have identified genes that correlate significantly with PSC, and these were identified by systematic review. Here we use novel Network Proximity Analysis (NPA) methods to identify already licensed candidate drugs that may have an effect on the genetically coded aspects of PSC pathophysiology.Over 2000 agents were identified as significantly linked to genes implicated in PSC by this method. The most significant results include previously researched agents such as metronidazole, as well as biological agents such as basiliximab, abatacept and belatacept. This in silico analysis could potentially serve as a basis for developing novel clinical trials in this rare disease.


Assuntos
Colangite Esclerosante , Colangite Esclerosante/tratamento farmacológico , Colangite Esclerosante/genética , Humanos , Estudo de Associação Genômica Ampla , Modelos Teóricos
14.
Drug Discov Today ; 29(3): 103848, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38052317

RESUMO

G-protein-coupled receptors (GPCRs) are the target of >30% of approved drugs. Despite their popularity, many of the >800 human GPCRs remain understudied. The Illuminating the Druggable Genome (IDG) project has generated many tools leading to important insights into the function and druggability of these so-called 'dark' receptors. These tools include assays, such as PRESTO-TANGO and TRUPATH, billions of small molecules made available via the ZINC virtual library, solved orphan GPCR structures, GPCR knock-in mice, and more. Together, these tools are illuminating the remaining 'dark' GPCRs.


Assuntos
Bioensaio , Receptores Acoplados a Proteínas G , Humanos , Animais , Camundongos , Receptores Acoplados a Proteínas G/química , Ligantes
15.
PeerJ ; 12: e17470, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38948230

RESUMO

TIN-X (Target Importance and Novelty eXplorer) is an interactive visualization tool for illuminating associations between diseases and potential drug targets and is publicly available at newdrugtargets.org. TIN-X uses natural language processing to identify disease and protein mentions within PubMed content using previously published tools for named entity recognition (NER) of gene/protein and disease names. Target data is obtained from the Target Central Resource Database (TCRD). Two important metrics, novelty and importance, are computed from this data and when plotted as log(importance) vs. log(novelty), aid the user in visually exploring the novelty of drug targets and their associated importance to diseases. TIN-X Version 3.0 has been significantly improved with an expanded dataset, modernized architecture including a REST API, and an improved user interface (UI). The dataset has been expanded to include not only PubMed publication titles and abstracts, but also full-text articles when available. This results in approximately 9-fold more target/disease associations compared to previous versions of TIN-X. Additionally, the TIN-X database containing this expanded dataset is now hosted in the cloud via Amazon RDS. Recent enhancements to the UI focuses on making it more intuitive for users to find diseases or drug targets of interest while providing a new, sortable table-view mode to accompany the existing plot-view mode. UI improvements also help the user browse the associated PubMed publications to explore and understand the basis of TIN-X's predicted association between a specific disease and a target of interest. While implementing these upgrades, computational resources are balanced between the webserver and the user's web browser to achieve adequate performance while accommodating the expanded dataset. Together, these advances aim to extend the duration that users can benefit from TIN-X while providing both an expanded dataset and new features that researchers can use to better illuminate understudied proteins.


Assuntos
Interface Usuário-Computador , Humanos , Processamento de Linguagem Natural , PubMed , Software
16.
PeerJ ; 11: e14927, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36874981

RESUMO

Background: Gene-gene co-expression correlations measured by mRNA-sequencing (RNA-seq) can be used to predict gene annotations based on the co-variance structure within these data. In our prior work, we showed that uniformly aligned RNA-seq co-expression data from thousands of diverse studies is highly predictive of both gene annotations and protein-protein interactions. However, the performance of the predictions varies depending on whether the gene annotations and interactions are cell type and tissue specific or agnostic. Tissue and cell type-specific gene-gene co-expression data can be useful for making more accurate predictions because many genes perform their functions in unique ways in different cellular contexts. However, identifying the optimal tissues and cell types to partition the global gene-gene co-expression matrix is challenging. Results: Here we introduce and validate an approach called PRediction of gene Insights from Stratified Mammalian gene co-EXPression (PrismEXP) for improved gene annotation predictions based on RNA-seq gene-gene co-expression data. Using uniformly aligned data from ARCHS4, we apply PrismEXP to predict a wide variety of gene annotations including pathway membership, Gene Ontology terms, as well as human and mouse phenotypes. Predictions made with PrismEXP outperform predictions made with the global cross-tissue co-expression correlation matrix approach on all tested domains, and training using one annotation domain can be used to predict annotations in other domains. Conclusions: By demonstrating the utility of PrismEXP predictions in multiple use cases we show how PrismEXP can be used to enhance unsupervised machine learning methods to better understand the roles of understudied genes and proteins. To make PrismEXP accessible, it is provided via a user-friendly web interface, a Python package, and an Appyter. AVAILABILITY. The PrismEXP web-based application, with pre-computed PrismEXP predictions, is available from: https://maayanlab.cloud/prismexp; PrismEXP is also available as an Appyter: https://appyters.maayanlab.cloud/PrismEXP/; and as Python package: https://github.com/maayanlab/prismexp.


Assuntos
Mamíferos , Humanos , Animais , Camundongos , Anotação de Sequência Molecular , Ontologia Genética , Fenótipo
17.
Curr Protoc ; 3(7): e845, 2023 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-37467006

RESUMO

Understudied or dark proteins have the potential to shed light on as-yet undiscovered molecular mechanisms that underlie phenotypes and suggest innovative therapeutic approaches for many diseases. The Reactome-IDG (Illuminating the Druggable Genome) project aims to place dark proteins in the context of manually curated, highly reliable pathways in Reactome, the most comprehensive, open-source biological pathway knowledgebase, facilitating the understanding functions and predicting therapeutic potentials of dark proteins. The Reactome-IDG web portal, deployed at https://idg.reactome.org, provides a simple, interactive web page for users to search pathways that may functionally interact with dark proteins, enabling the prediction of functions of dark proteins in the context of Reactome pathways. Enhanced visualization features implemented at the portal allow users to investigate the functional contexts for dark proteins based on tissue-specific gene or protein expression, drug-target interactions, or protein or gene pairwise relationships in the original Reactome's systems biology graph notation (SBGN) diagrams or the new simplified functional interaction (FI) network view of pathways. The protocols in this chapter describe step-by-step procedures to use the web portal to learn biological functions of dark proteins in the context of Reactome pathways. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Search for interacting pathways of a protein Support Protocol: Interacting pathway results for an annotated protein Alternate Protocol: Use individual pairwise relationships to predict interacting pathways of a protein Basic Protocol 2: Using the IDG pathway browser to study interacting pathways Basic Protocol 3: Overlaying tissue-specific expression data Basic Protocol 4: Overlaying protein/gene pairwise relationships in the pathway context Basic Protocol 5: Visualizing drug/target interactions.


Assuntos
Redes e Vias Metabólicas , Transdução de Sinais , Biologia de Sistemas/métodos , Proteômica , Proteínas/metabolismo
18.
PeerJ ; 11: e15153, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37151295

RESUMO

The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.surechembl.org/), according to how likely they were to contain bioactivity data for potent small molecules on less-studied targets, based on the classification developed by the Illuminating the Druggable Genome (IDG) project. The overall goal was to select a smaller number of patents that could be manually curated and incorporated into the ChEMBL database. Using relatively simple annotation and filtering pipelines, we have been able to identify a substantial number of patents containing quantitative bioactivity data for understudied targets that had not previously been reported in the peer-reviewed medicinal chemistry literature. We quantify the added value of such methods in terms of the numbers of targets that are so identified, and provide some specific illustrative examples. Our work underlines the potential value in searching the patent corpus in addition to the more traditional peer-reviewed literature. The small molecules found in these patents, together with their measured activity against the targets, are now accessible via the ChEMBL database.


Assuntos
Química Farmacêutica , Descoberta de Drogas , Descoberta de Drogas/métodos , Bases de Dados Factuais
19.
Cell Rep ; 42(2): 112086, 2023 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-36790929

RESUMO

Ischemic cardiomyopathy (ICM) is the leading cause of heart failure worldwide, yet the cellular and molecular signature of this disease is largely unclear. Using single-nucleus RNA sequencing (snRNA-seq) and integrated computational analyses, we profile the transcriptomes of over 99,000 human cardiac nuclei from the non-infarct region of the left ventricle of 7 ICM transplant recipients and 8 non-failing (NF) controls. We find the cellular composition of the ischemic heart is significantly altered, with decreased cardiomyocytes and increased proportions of lymphatic, angiogenic, and arterial endothelial cells in patients with ICM. We show that there is increased LAMININ signaling from endothelial cells to other cell types in ICM compared with NF. Finally, we find that the transcriptional changes that occur in ICM are similar to those in hypertrophic and dilated cardiomyopathies and that the mining of these combined datasets can identify druggable genes that could be used to target end-stage heart failure.


Assuntos
Cardiomiopatias , Cardiomiopatia Dilatada , Insuficiência Cardíaca , Isquemia Miocárdica , Humanos , Células Endoteliais/metabolismo , Isquemia Miocárdica/genética , Isquemia Miocárdica/metabolismo , Insuficiência Cardíaca/genética , Insuficiência Cardíaca/metabolismo , Análise de Sequência de RNA , Cardiomiopatias/genética
20.
PeerJ ; 11: e15815, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37868056

RESUMO

The 534 protein kinases encoded in the human genome constitute a large druggable class of proteins that include both well-studied and understudied "dark" members. Accurate prediction of dark kinase functions is a major bioinformatics challenge. Here, we employ a graph mining approach that uses the evolutionary and functional context encoded in knowledge graphs (KGs) to predict protein and pathway associations for understudied kinases. We propose a new scalable graph embedding approach, RegPattern2Vec, which employs regular pattern constrained random walks to sample diverse aspects of node context within a KG flexibly. RegPattern2Vec learns functional representations of kinases, interacting partners, post-translational modifications, pathways, cellular localization, and chemical interactions from a kinase-centric KG that integrates and conceptualizes data from curated heterogeneous data resources. By contextualizing information relevant to prediction, RegPattern2Vec improves accuracy and efficiency in comparison to other random walk-based graph embedding approaches. We show that the predictions produced by our model overlap with pathway enrichment data produced using experimentally validated Protein-Protein Interaction (PPI) data from both publicly available databases and experimental datasets not used in training. Our model also has the advantage of using the collected random walks as biological context to interpret the predicted protein-pathway associations. We provide high-confidence pathway predictions for 34 dark kinases and present three case studies in which analysis of meta-paths associated with the prediction enables biological interpretation. Overall, RegPattern2Vec efficiently samples multiple node types for link prediction on biological knowledge graphs and the predicted associations between understudied kinases, pseudokinases, and known pathways serve as a conceptual starting point for hypothesis generation and testing.


Assuntos
Reconhecimento Automatizado de Padrão , Proteínas , Humanos , Proteínas/genética , Biologia Computacional , Aprendizagem , Conhecimento
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA