Pesquisa | BVS IEC

1.

DrugCentral 2023 extends human clinical data and integrates veterinary drugs.

Avram, Sorin; Wilson, Thomas B; Curpan, Ramona; Halip, Liliana; Borota, Ana; Bora, Alina; Bologa, Cristian G; Holmes, Jayme; Knockel, Jeffrey; Yang, Jeremy J; Oprea, Tudor I.

Nucleic Acids Res ; 51(D1): D1276-D1287, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36484092

RESUMO

DrugCentral monitors new drug approvals and standardizes drug information. The current update contains 285 drugs (131 for human use). New additions include: (i) the integration of veterinary drugs (154 for animal use only), (ii) the addition of 66 documented off-label uses and iii) the identification of adverse drug events from pharmacovigilance data for pediatric and geriatric patients. Additional enhancements include chemical substructure searching using SMILES and 'Target Cards' based on UniProt accession codes. Statistics of interests include the following: (i) 60% of the covered drugs are on-market drugs with expired patent and exclusivity coverage, 17% are off-market, and 23% are on-market drugs with active patents and exclusivity coverage; (ii) 59% of the drugs are oral, 33% are parenteral and 18% topical, at the level of the active ingredients; (iii) only 3% of all drugs are for animal use only; however, 61% of the veterinary drugs are also approved for human use; (iv) dogs, cats and horses are by far the most represented target species for veterinary drugs; (v) the physicochemical property profile of animal drugs is very similar to that of human drugs. Use cases include azaperone, the only sedative approved for swine, and ruxolitinib, a Janus kinase inhibitor.

Assuntos

Aprovação de Drogas , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Drogas Veterinárias , Animais , Humanos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/veterinária , Drogas Veterinárias/administração & dosagem , Drogas Veterinárias/efeitos adversos , Uso Off-Label/veterinária

2.

Pharos 2023: an integrated resource for the understudied human proteome.

Kelleher, Keith J; Sheils, Timothy K; Mathias, Stephen L; Yang, Jeremy J; Metzger, Vincent T; Siramshetty, Vishal B; Nguyen, Dac-Trung; Jensen, Lars Juhl; Vidovic, Dusica; Schürer, Stephan C; Holmes, Jayme; Sharma, Karlie R; Pillai, Ajay; Bologa, Cristian G; Edwards, Jeremy S; Mathé, Ewy A; Oprea, Tudor I.

Nucleic Acids Res ; 51(D1): D1405-D1416, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36624666

RESUMO

The Illuminating the Druggable Genome (IDG) project aims to improve our understanding of understudied proteins and our ability to study them in the context of disease biology by perturbing them with small molecules, biologics, or other therapeutic modalities. Two main products from the IDG effort are the Target Central Resource Database (TCRD) (http://juniper.health.unm.edu/tcrd/), which curates and aggregates information, and Pharos (https://pharos.nih.gov/), a web interface for fusers to extract and visualize data from TCRD. Since the 2021 release, TCRD/Pharos has focused on developing visualization and analysis tools that help reveal higher-level patterns in the underlying data. The current iterations of TCRD and Pharos enable users to perform enrichment calculations based on subsets of targets, diseases, or ligands and to create interactive heat maps and UpSet charts of many types of annotations. Using several examples, we show how to address disease biology and drug discovery questions through enrichment calculations and UpSet charts.

Assuntos

Bases de Dados Factuais , Terapia de Alvo Molecular , Proteoma , Humanos , Produtos Biológicos , Descoberta de Drogas , Internet , Proteoma/efeitos dos fármacos

3.

DrugCentral 2021 supports drug discovery and repositioning.

Avram, Sorin; Bologa, Cristian G; Holmes, Jayme; Bocci, Giovanni; Wilson, Thomas B; Nguyen, Dac-Trung; Curpan, Ramona; Halip, Liliana; Bora, Alina; Yang, Jeremy J; Knockel, Jeffrey; Sirimulla, Suman; Ursu, Oleg; Oprea, Tudor I.

Nucleic Acids Res ; 49(D1): D1160-D1169, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33151287

RESUMO

DrugCentral is a public resource (http://drugcentral.org) that serves the scientific community by providing up-to-date drug information, as described in previous papers. The current release includes 109 newly approved (October 2018 through March 2020) active pharmaceutical ingredients in the US, Europe, Japan and other countries; and two molecular entities (e.g. mefuparib) of interest for COVID19. New additions include a set of pharmacokinetic properties for â¼1000 drugs, and a sex-based separation of side effects, processed from FAERS (FDA Adverse Event Reporting System); as well as a drug repositioning prioritization scheme based on the market availability and intellectual property rights forFDA approved drugs. In the context of the COVID19 pandemic, we also incorporated REDIAL-2020, a machine learning platform that estimates anti-SARS-CoV-2 activities, as well as the 'drugs in news' feature offers a brief enumeration of the most interesting drugs at the present moment. The full database dump and data files are available for download from the DrugCentral web portal.

Assuntos

Antivirais/uso terapêutico , Tratamento Farmacológico da COVID-19 , Bases de Dados de Produtos Farmacêuticos/estatística & dados numéricos , Aprovação de Drogas/estatística & dados numéricos , Descoberta de Drogas/estatística & dados numéricos , Reposicionamento de Medicamentos/estatística & dados numéricos , SARS-CoV-2/efeitos dos fármacos , Antivirais/efeitos adversos , Antivirais/farmacocinética , COVID-19/epidemiologia , COVID-19/virologia , Aprovação de Drogas/métodos , Descoberta de Drogas/métodos , Reposicionamento de Medicamentos/métodos , Epidemias , Europa (Continente) , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Japão , SARS-CoV-2/fisiologia , Estados Unidos

4.

TCRD and Pharos 2021: mining the human proteome for disease biology.

Sheils, Timothy K; Mathias, Stephen L; Kelleher, Keith J; Siramshetty, Vishal B; Nguyen, Dac-Trung; Bologa, Cristian G; Jensen, Lars Juhl; Vidovic, Dusica; Koleti, Amar; Schürer, Stephan C; Waller, Anna; Yang, Jeremy J; Holmes, Jayme; Bocci, Giovanni; Southall, Noel; Dharkar, Poorva; Mathé, Ewy; Simeonov, Anton; Oprea, Tudor I.

Nucleic Acids Res ; 49(D1): D1334-D1346, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33156327

RESUMO

In 2014, the National Institutes of Health (NIH) initiated the Illuminating the Druggable Genome (IDG) program to identify and improve our understanding of poorly characterized proteins that can potentially be modulated using small molecules or biologics. Two resources produced from these efforts are: The Target Central Resource Database (TCRD) (http://juniper.health.unm.edu/tcrd/) and Pharos (https://pharos.nih.gov/), a web interface to browse the TCRD. The ultimate goal of these resources is to highlight and facilitate research into currently understudied proteins, by aggregating a multitude of data sources, and ranking targets based on the amount of data available, and presenting data in machine learning ready format. Since the 2017 release, both TCRD and Pharos have produced two major releases, which have incorporated or expanded an additional 25 data sources. Recently incorporated data types include human and viral-human protein-protein interactions, protein-disease and protein-phenotype associations, and drug-induced gene signatures, among others. These aggregated data have enabled us to generate new visualizations and content sections in Pharos, in order to empower users to find new areas of study in the druggable genome.

Assuntos

Bases de Dados Factuais , Genoma Humano , Doenças Neurodegenerativas/genética , Proteômica/métodos , Software , Viroses/genética , Animais , Anticonvulsivantes/química , Anticonvulsivantes/uso terapêutico , Antivirais/química , Antivirais/uso terapêutico , Produtos Biológicos/química , Produtos Biológicos/uso terapêutico , Mineração de Dados/estatística & dados numéricos , Interações Hospedeiro-Patógeno/efeitos dos fármacos , Interações Hospedeiro-Patógeno/genética , Humanos , Internet , Aprendizado de Máquina/estatística & dados numéricos , Camundongos , Camundongos Knockout , Terapia de Alvo Molecular/métodos , Doenças Neurodegenerativas/classificação , Doenças Neurodegenerativas/tratamento farmacológico , Doenças Neurodegenerativas/virologia , Mapeamento de Interação de Proteínas , Proteoma/agonistas , Proteoma/antagonistas & inibidores , Proteoma/genética , Proteoma/metabolismo , Bibliotecas de Moléculas Pequenas/química , Bibliotecas de Moléculas Pequenas/uso terapêutico , Viroses/classificação , Viroses/tratamento farmacológico , Viroses/virologia

5.

Interdependence and the cost of uncoordinated responses to COVID-19.

Holtz, David; Zhao, Michael; Benzell, Seth G; Cao, Cathy Y; Rahimian, Mohammad Amin; Yang, Jeremy; Allen, Jennifer; Collis, Avinash; Moehring, Alex; Sowrirajan, Tara; Ghosh, Dipayan; Zhang, Yunhao; Dhillon, Paramveer S; Nicolaides, Christos; Eckles, Dean; Aral, Sinan.

Proc Natl Acad Sci U S A ; 117(33): 19837-19843, 2020 08 18.

Artigo em Inglês | MEDLINE | ID: mdl-32732433

RESUMO

Social distancing is the core policy response to coronavirus disease 2019 (COVID-19). But, as federal, state and local governments begin opening businesses and relaxing shelter-in-place orders worldwide, we lack quantitative evidence on how policies in one region affect mobility and social distancing in other regions and the consequences of uncoordinated regional policies adopted in the presence of such spillovers. To investigate this concern, we combined daily, county-level data on shelter-in-place policies with movement data from over 27 million mobile devices, social network connections among over 220 million Facebook users, daily temperature and precipitation data from 62,000 weather stations, and county-level census data on population demographics to estimate the geographic and social network spillovers created by regional policies across the United States. Our analysis shows that the contact patterns of people in a given region are significantly influenced by the policies and behaviors of people in other, sometimes distant, regions. When just one-third of a state's social and geographic peer states adopt shelter-in-place policies, it creates a reduction in mobility equal to the state's own policy decisions. These spillovers are mediated by peer travel and distancing behaviors in those states. A simple analytical model calibrated with our empirical estimates demonstrated that the "loss from anarchy" in uncoordinated state policies is increasing in the number of noncooperating states and the size of social and geographic spillovers. These results suggest a substantial cost of uncoordinated government responses to COVID-19 when people, ideas, and media move across borders.

Assuntos

COVID-19/prevenção & controle , Infecções por Coronavirus/prevenção & controle , Análise Custo-Benefício , Eficiência Organizacional , Modelos Logísticos , Pandemias/prevenção & controle , Pneumonia Viral/prevenção & controle , Quarentena/organização & administração , COVID-19/economia , Infecções por Coronavirus/economia , Demografia/estatística & dados numéricos , Humanos , Pandemias/economia , Distanciamento Físico , Pneumonia Viral/economia , Quarentena/economia , Quarentena/métodos , Mídias Sociais/estatística & dados numéricos , Meios de Transporte/estatística & dados numéricos , Estados Unidos

6.

Knowledge graph analytics platform with LINCS and IDG for Parkinson's disease target illumination.

Yang, Jeremy J; Gessner, Christopher R; Duerksen, Joel L; Biber, Daniel; Binder, Jessica L; Ozturk, Murat; Foote, Brian; McEntire, Robin; Stirling, Kyle; Ding, Ying; Wild, David J.

BMC Bioinformatics ; 23(1): 37, 2022 Jan 12.

Artigo em Inglês | MEDLINE | ID: mdl-35021991

RESUMO

BACKGROUND: LINCS, "Library of Integrated Network-based Cellular Signatures", and IDG, "Illuminating the Druggable Genome", are both NIH projects and consortia that have generated rich datasets for the study of the molecular basis of human health and disease. LINCS L1000 expression signatures provide unbiased systems/omics experimental evidence. IDG provides compiled and curated knowledge for illumination and prioritization of novel drug target hypotheses. Together, these resources can support a powerful new approach to identifying novel drug targets for complex diseases, such as Parkinson's disease (PD), which continues to inflict severe harm on human health, and resist traditional research approaches. RESULTS: Integrating LINCS and IDG, we built the Knowledge Graph Analytics Platform (KGAP) to support an important use case: identification and prioritization of drug target hypotheses for associated diseases. The KGAP approach includes strong semantics interpretable by domain scientists and a robust, high performance implementation of a graph database and related analytical methods. Illustrating the value of our approach, we investigated results from queries relevant to PD. Approved PD drug indications from IDG's resource DrugCentral were used as starting points for evidence paths exploring chemogenomic space via LINCS expression signatures for associated genes, evaluated as target hypotheses by integration with IDG. The KG-analytic scoring function was validated against a gold standard dataset of genes associated with PD as elucidated, published mechanism-of-action drug targets, also from DrugCentral. IDG's resource TIN-X was used to rank and filter KGAP results for novel PD targets, and one, SYNGR3 (Synaptogyrin-3), was manually investigated further as a case study and plausible new drug target for PD. CONCLUSIONS: The synergy of LINCS and IDG, via KG methods, empowers graph analytics methods for the investigation of the molecular basis of complex diseases, and specifically for identification and prioritization of novel drug targets. The KGAP approach enables downstream applications via integration with resources similarly aligned with modern KG methodology. The generality of the approach indicates that KGAP is applicable to many disease areas, in addition to PD, the focus of this paper.

Assuntos

Doença de Parkinson , Biblioteca Gênica , Genoma , Humanos , Iluminação , Doença de Parkinson/tratamento farmacológico , Doença de Parkinson/genética , Reconhecimento Automatizado de Padrão

7.

TIGA: target illumination GWAS analytics.

Yang, Jeremy J; Grissa, Dhouha; Lambert, Christophe G; Bologa, Cristian G; Mathias, Stephen L; Waller, Anna; Wild, David J; Jensen, Lars Juhl; Oprea, Tudor I.

Bioinformatics ; 37(21): 3865-3873, 2021 11 05.

Artigo em Inglês | MEDLINE | ID: mdl-34086846

RESUMO

MOTIVATION: Genome-wide association studies can reveal important genotype-phenotype associations; however, data quality and interpretability issues must be addressed. For drug discovery scientists seeking to prioritize targets based on the available evidence, these issues go beyond the single study. RESULTS: Here, we describe rational ranking, filtering and interpretation of inferred gene-trait associations and data aggregation across studies by leveraging existing curation and harmonization efforts. Each gene-trait association is evaluated for confidence, with scores derived solely from aggregated statistics, linking a protein-coding gene and phenotype. We propose a method for assessing confidence in gene-trait associations from evidence aggregated across studies, including a bibliometric assessment of scientific consensus based on the iCite relative citation ratio, and meanRank scores, to aggregate multivariate evidence.This method, intended for drug target hypothesis generation, scoring and ranking, has been implemented as an analytical pipeline, available as open source, with public datasets of results, and a web application designed for usability by drug discovery scientists. AVAILABILITY AND IMPLEMENTATION: Web application, datasets and source code via https://unmtid-shinyapps.net/tiga/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Estudo de Associação Genômica Ampla , Iluminação , Genótipo , Polimorfismo de Nucleotídeo Único , Fenótipo

8.

DrugCentral 2018: an update.

Ursu, Oleg; Holmes, Jayme; Bologa, Cristian G; Yang, Jeremy J; Mathias, Stephen L; Stathias, Vasileios; Nguyen, Dac-Trung; Schürer, Stephan; Oprea, Tudor.

Nucleic Acids Res ; 47(D1): D963-D970, 2019 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-30371892

RESUMO

DrugCentral is a drug information resource (http://drugcentral.org) open to the public since 2016 and previously described in the 2017 Nucleic Acids Research Database issue. Since the 2016 release, 103 new approved drugs were updated. The following new data sources have been included: Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS), FDA Orange Book information, L1000 gene perturbation profile distance/similarity matrices and estimated protonation constants. New and existing entries have been updated with the latest information from scientific literature, drug labels and external databases. The web interface has been updated to display and query new data. The full database dump and data files are available for download from the DrugCentral website.

Assuntos

Bases de Dados de Produtos Farmacêuticos , Aprovação de Drogas/estatística & dados numéricos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Expressão Gênica/efeitos dos fármacos , Preparações Farmacêuticas/classificação , Proteínas/classificação

9.

edge2vec: Representation learning using edge semantics for biomedical knowledge discovery.

Gao, Zheng; Fu, Gang; Ouyang, Chunping; Tsutsui, Satoshi; Liu, Xiaozhong; Yang, Jeremy; Gessner, Christopher; Foote, Brian; Wild, David; Ding, Ying; Yu, Qi.

BMC Bioinformatics ; 20(1): 306, 2019 Jun 10.

Artigo em Inglês | MEDLINE | ID: mdl-31238875

RESUMO

BACKGROUND: Representation learning provides new and powerful graph analytical approaches and tools for the highly valued data science challenge of mining knowledge graphs. Since previous graph analytical methods have mostly focused on homogeneous graphs, an important current challenge is extending this methodology for richly heterogeneous graphs and knowledge domains. The biomedical sciences are such a domain, reflecting the complexity of biology, with entities such as genes, proteins, drugs, diseases, and phenotypes, and relationships such as gene co-expression, biochemical regulation, and biomolecular inhibition or activation. Therefore, the semantics of edges and nodes are critical for representation learning and knowledge discovery in real world biomedical problems. RESULTS: In this paper, we propose the edge2vec model, which represents graphs considering edge semantics. An edge-type transition matrix is trained by an Expectation-Maximization approach, and a stochastic gradient descent model is employed to learn node embedding on a heterogeneous graph via the trained transition matrix. edge2vec is validated on three biomedical domain tasks: biomedical entity classification, compound-gene bioactivity prediction, and biomedical information retrieval. Results show that by considering edge-types into node embedding learning in heterogeneous graphs, edge2vec significantly outperforms state-of-the-art models on all three tasks. CONCLUSIONS: We propose this method for its added value relative to existing graph analytical methodology, and in the real world context of biomedical knowledge discovery applicability.

Assuntos

Informática/métodos , Conhecimento , Aprendizagem , Algoritmos , Pesquisa Biomédica , Humanos , Redes Neurais de Computação , Semântica

10.

DrugCentral: online drug compendium.

Ursu, Oleg; Holmes, Jayme; Knockel, Jeffrey; Bologa, Cristian G; Yang, Jeremy J; Mathias, Stephen L; Nelson, Stuart J; Oprea, Tudor I.

Nucleic Acids Res ; 45(D1): D932-D939, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27789690

RESUMO

DrugCentral (http://drugcentral.org) is an open-access online drug compendium. DrugCentral integrates structure, bioactivity, regulatory, pharmacologic actions and indications for active pharmaceutical ingredients approved by FDA and other regulatory agencies. Monitoring of regulatory agencies for new drugs approvals ensures the resource is up-to-date. DrugCentral integrates content for active ingredients with pharmaceutical formulations, indexing drugs and drug label annotations, complementing similar resources available online. Its complementarity with other online resources is facilitated by cross referencing to external resources. At the molecular level, DrugCentral bridges drug-target interactions with pharmacological action and indications. The integration with FDA drug labels enables text mining applications for drug adverse events and clinical trial information. Chemical structure overlap between DrugCentral and five online drug resources, and the overlap between DrugCentral FDA-approved drugs and their presence in four different chemical collections, are discussed. DrugCentral can be accessed via the web application or downloaded in relational database format.

Assuntos

Bases de Dados de Produtos Farmacêuticos , Ferramenta de Busca , Navegador , Aprovação de Drogas , Composição de Medicamentos , Interações Medicamentosas , Rotulagem de Medicamentos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Humanos , Preparações Farmacêuticas/química , Estados Unidos , United States Food and Drug Administration

11.

Pharos: Collating protein information to shed light on the druggable genome.

Nguyen, Dac-Trung; Mathias, Stephen; Bologa, Cristian; Brunak, Soren; Fernandez, Nicolas; Gaulton, Anna; Hersey, Anne; Holmes, Jayme; Jensen, Lars Juhl; Karlsson, Anneli; Liu, Guixia; Ma'ayan, Avi; Mandava, Geetha; Mani, Subramani; Mehta, Saurabh; Overington, John; Patel, Juhee; Rouillard, Andrew D; Schürer, Stephan; Sheils, Timothy; Simeonov, Anton; Sklar, Larry A; Southall, Noel; Ursu, Oleg; Vidovic, Dusica; Waller, Anna; Yang, Jeremy; Jadhav, Ajit; Oprea, Tudor I; Guha, Rajarshi.

Nucleic Acids Res ; 45(D1): D995-D1002, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27903890

RESUMO

The 'druggable genome' encompasses several protein families, but only a subset of targets within them have attracted significant research attention and thus have information about them publicly available. The Illuminating the Druggable Genome (IDG) program was initiated in 2014, has the goal of developing experimental techniques and a Knowledge Management Center (KMC) that would collect and organize information about protein targets from four families, representing the most common druggable targets with an emphasis on understudied proteins. Here, we describe two resources developed by the KMC: the Target Central Resource Database (TCRD) which collates many heterogeneous gene/protein datasets and Pharos (https://pharos.nih.gov), a multimodal web interface that presents the data from TCRD. We briefly describe the types and sources of data considered by the KMC and then highlight features of the Pharos interface designed to enable intuitive access to the IDG knowledgebase. The aim of Pharos is to encourage 'serendipitous browsing', whereby related, relevant information is made easily discoverable. We conclude by describing two use cases that highlight the utility of Pharos and TCRD.

Assuntos

Bases de Dados Genéticas , Descoberta de Drogas , Genômica , Farmacogenética , Ferramenta de Busca , Análise por Conglomerados , Biologia Computacional/métodos , Descoberta de Drogas/métodos , Genômica/métodos , Humanos , Obesidade/tratamento farmacológico , Obesidade/genética , Obesidade/metabolismo , Farmacogenética/métodos , Software , Navegador

12.

TIN-X: target importance and novelty explorer.

Cannon, Daniel C; Yang, Jeremy J; Mathias, Stephen L; Ursu, Oleg; Mani, Subramani; Waller, Anna; Schürer, Stephan C; Jensen, Lars Juhl; Sklar, Larry A; Bologa, Cristian G; Oprea, Tudor I.

Bioinformatics ; 33(16): 2601-2603, 2017 Aug 15.

Artigo em Inglês | MEDLINE | ID: mdl-28398460

RESUMO

MOTIVATION: The increasing amount of peer-reviewed manuscripts requires the development of specific mining tools to facilitate the visual exploration of evidence linking diseases and proteins. RESULTS: We developed TIN-X, the Target Importance and Novelty eXplorer, to visualize the association between proteins and diseases, based on text mining data processed from scientific literature. In the current implementation, TIN-X supports exploration of data for G-protein coupled receptors, kinases, ion channels, and nuclear receptors. TIN-X supports browsing and navigating across proteins and diseases based on ontology classes, and displays a scatter plot with two proposed new bibliometric statistics: Importance and Novelty. AVAILABILITY AND IMPLEMENTATION: http://www.newdrugtargets.org. CONTACT: cbologa@salud.unm.edu.

Assuntos

Mineração de Dados/métodos , Doença/etiologia , Software , Ontologias Biológicas , Gráficos por Computador , Humanos , Canais Iônicos/metabolismo , Fosfotransferases/metabolismo , Receptores Citoplasmáticos e Nucleares/metabolismo , Receptores Acoplados a Proteínas G/metabolismo

13.

User Centered Rare Disease Clinical Trial Knowledge Graph (RCTKG).

Yang, Jeremy Parker; Leadman, Devon; Ballew, Richard M; Sid, Eric; Xu, Yanji; Mathé, Ewy A; Zhu, Qian.

Stud Health Technol Inform ; 310: 94-98, 2024 Jan 25.

Artigo em Inglês | MEDLINE | ID: mdl-38269772

RESUMO

Drug development in rare diseases is challenging due to the limited availability of subjects with the diseases and recruiting from a small patient population. The high cost and low success rate of clinical trials motivate deliberate analysis of existing clinical trials to understand status of clinical development of orphan drugs and discover new insight for new trial. In this project, we aim to develop a user centered Rare disease based Clinical Trial Knowledge Graph (RCTKG) to integrate publicly available clinical trial data with rare diseases from the Genetic and Rare Disease (GARD) program in a semantic and standardized form for public use. To better serve and represent the interests of rare disease users, user stories were defined for three types of users, patients, healthcare providers and informaticians, to guide the RCTKG design in supporting the GARD program at NCATS/NIH and the broad clinical/research community in rare diseases.

Assuntos

Reconhecimento Automatizado de Padrão , Doenças Raras , Humanos , Doenças Raras/tratamento farmacológico , Doenças Raras/genética , Pessoal de Saúde , Conhecimento

14.

Overview of the Knowledge Management Center for Illuminating the Druggable Genome.

Oprea, Tudor I; Bologa, Cristian; Holmes, Jayme; Mathias, Stephen; Metzger, Vincent T; Waller, Anna; Yang, Jeremy J; Leach, Andrew R; Jensen, Lars Juhl; Kelleher, Keith J; Sheils, Timothy K; Mathé, Ewy; Avram, Sorin; Edwards, Jeremy S.

Drug Discov Today ; 29(3): 103882, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38218214

RESUMO

The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations. Based on this knowledge, proteins are classified into different Target Development Levels (TDLs) for identification of understudied targets. Additional work by the KMC focuses on enriching target knowledge and producing DrugCentral and other data visualization tools for expanding investigation of understudied targets.

Assuntos

Genoma , Gestão do Conhecimento , Humanos , Proteoma , Bases de Dados Factuais , Informática

15.

Node-degree aware edge sampling mitigates inflated classification performance in biomedical random walk-based graph representation learning.

Cappelletti, Luca; Rekerle, Lauren; Fontana, Tommaso; Hansen, Peter; Casiraghi, Elena; Ravanmehr, Vida; Mungall, Christopher J; Yang, Jeremy J; Spranger, Leonard; Karlebach, Guy; Caufield, J Harry; Carmody, Leigh; Coleman, Ben; Oprea, Tudor I; Reese, Justin; Valentini, Giorgio; Robinson, Peter N.

Bioinform Adv ; 4(1): vbae036, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38577542

RESUMO

Motivation: Graph representation learning is a family of related approaches that learn low-dimensional vector representations of nodes and other graph elements called embeddings. Embeddings approximate characteristics of the graph and can be used for a variety of machine-learning tasks such as novel edge prediction. For many biomedical applications, partial knowledge exists about positive edges that represent relationships between pairs of entities, but little to no knowledge is available about negative edges that represent the explicit lack of a relationship between two nodes. For this reason, classification procedures are forced to assume that the vast majority of unlabeled edges are negative. Existing approaches to sampling negative edges for training and evaluating classifiers do so by uniformly sampling pairs of nodes. Results: We show here that this sampling strategy typically leads to sets of positive and negative examples with imbalanced node degree distributions. Using representative heterogeneous biomedical knowledge graph and random walk-based graph machine learning, we show that this strategy substantially impacts classification performance. If users of graph machine-learning models apply the models to prioritize examples that are drawn from approximately the same distribution as the positive examples are, then performance of models as estimated in the validation phase may be artificially inflated. We present a degree-aware node sampling approach that mitigates this effect and is simple to implement. Availability and implementation: Our code and data are publicly available at https://github.com/monarch-initiative/negativeExampleSelection.

16.

Computational drug repositioning identifies niclosamide and tribromsalan as inhibitors of Mycobacterium tuberculosis and Mycobacterium abscessus.

Yang, Jeremy J; Goff, Aaron; Wild, David J; Ding, Ying; Annis, Ayano; Kerber, Randy; Foote, Brian; Passi, Anurag; Duerksen, Joel L; London, Shelley; Puhl, Ana C; Lane, Thomas R; Braunstein, Miriam; Waddell, Simon J; Ekins, Sean.

Tuberculosis (Edinb) ; 146: 102500, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38432118

RESUMO

Tuberculosis (TB) is still a major global health challenge, killing over 1.5 million people each year, and hence, there is a need to identify and develop novel treatments for Mycobacterium tuberculosis (M. tuberculosis). The prevalence of infections caused by nontuberculous mycobacteria (NTM) is also increasing and has overtaken TB cases in the United States and much of the developed world. Mycobacterium abscessus (M. abscessus) is one of the most frequently encountered NTM and is difficult to treat. We describe the use of drug-disease association using a semantic knowledge graph approach combined with machine learning models that has enabled the identification of several molecules for testing anti-mycobacterial activity. We established that niclosamide (M. tuberculosis IC90 2.95 µM; M. abscessus IC90 59.1 µM) and tribromsalan (M. tuberculosis IC90 76.92 µM; M. abscessus IC90 147.4 µM) inhibit M. tuberculosis and M. abscessus in vitro. To investigate the mode of action, we determined the transcriptional response of M. tuberculosis and M. abscessus to both compounds in axenic log phase, demonstrating a broad effect on gene expression that differed from known M. tuberculosis inhibitors. Both compounds elicited transcriptional responses indicative of respiratory pathway stress and the dysregulation of fatty acid metabolism.

Assuntos

Infecções por Mycobacterium não Tuberculosas , Mycobacterium abscessus , Mycobacterium tuberculosis , Salicilanilidas , Tuberculose , Humanos , Mycobacterium tuberculosis/genética , Infecções por Mycobacterium não Tuberculosas/microbiologia , Niclosamida/farmacologia , Reposicionamento de Medicamentos , Micobactérias não Tuberculosas/genética , Tuberculose/tratamento farmacológico , Tuberculose/microbiologia

17.

TIN-X version 3: update with expanded dataset and modernized architecture for enhanced illumination of understudied targets.

Metzger, Vincent T; Cannon, Daniel C; Yang, Jeremy J; Mathias, Stephen L; Bologa, Cristian G; Waller, Anna; Schürer, Stephan C; Vidovic, Dusica; Kelleher, Keith J; Sheils, Timothy K; Jensen, Lars Juhl; Lambert, Christophe G; Oprea, Tudor I; Edwards, Jeremy S.

PeerJ ; 12: e17470, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38948230

RESUMO

TIN-X (Target Importance and Novelty eXplorer) is an interactive visualization tool for illuminating associations between diseases and potential drug targets and is publicly available at newdrugtargets.org. TIN-X uses natural language processing to identify disease and protein mentions within PubMed content using previously published tools for named entity recognition (NER) of gene/protein and disease names. Target data is obtained from the Target Central Resource Database (TCRD). Two important metrics, novelty and importance, are computed from this data and when plotted as log(importance) vs. log(novelty), aid the user in visually exploring the novelty of drug targets and their associated importance to diseases. TIN-X Version 3.0 has been significantly improved with an expanded dataset, modernized architecture including a REST API, and an improved user interface (UI). The dataset has been expanded to include not only PubMed publication titles and abstracts, but also full-text articles when available. This results in approximately 9-fold more target/disease associations compared to previous versions of TIN-X. Additionally, the TIN-X database containing this expanded dataset is now hosted in the cloud via Amazon RDS. Recent enhancements to the UI focuses on making it more intuitive for users to find diseases or drug targets of interest while providing a new, sortable table-view mode to accompany the existing plot-view mode. UI improvements also help the user browse the associated PubMed publications to explore and understand the basis of TIN-X's predicted association between a specific disease and a target of interest. While implementing these upgrades, computational resources are balanced between the webserver and the user's web browser to achieve adequate performance while accommodating the expanded dataset. Together, these advances aim to extend the duration that users can benefit from TIN-X while providing both an expanded dataset and new features that researchers can use to better illuminate understudied proteins.

Assuntos

Interface Usuário-Computador , Humanos , Processamento de Linguagem Natural , PubMed , Software

18.

Concurrent Pediatric Lingual and Submental Dermoid Cysts: Case Report and Literature Review.

Gleichmann, Natasha; Creighton, Elizabeth; Zhu, Austin; Willard, Nicholas; Yang, Jeremy; Herrmann, Brian W.

Cureus ; 15(7): e42429, 2023 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-37637563

RESUMO

This pediatric case report describes the novel finding of concurrent submental and lingual dermoid cysts, which to our knowledge, has not been previously reported in the literature. The etiology of cysts involving the tongue, floor of the mouth, and submental neck is varied, representing congenital, inflammatory, and neoplastic sources. Dermoid cysts involving these regions are uncommon and are most frequently reported in the submental, sublingual, and lingual spaces. Presenting symptoms vary with cyst size and position relative to the mylohyoid muscle. MRI is the preferred modality to differentiate dermoid cysts from other etiologies. While interventional techniques have been utilized to treat dermoid cysts in other head and neck locations, surgical excision remains the preferred treatment for those involving oral and floor-of-mouth structures.

19.

Toxicology knowledge graph for structural birth defects.

Evangelista, John Erol; Clarke, Daniel J B; Xie, Zhuorui; Marino, Giacomo B; Utti, Vivian; Jenkins, Sherry L; Ahooyi, Taha Mohseni; Bologa, Cristian G; Yang, Jeremy J; Binder, Jessica L; Kumar, Praveen; Lambert, Christophe G; Grethe, Jeffrey S; Wenger, Eric; Taylor, Deanne; Oprea, Tudor I; de Bono, Bernard; Ma'ayan, Avi.

Commun Med (Lond) ; 3(1): 98, 2023 Jul 17.

Artigo em Inglês | MEDLINE | ID: mdl-37460679

RESUMO

BACKGROUND: Birth defects are functional and structural abnormalities that impact about 1 in 33 births in the United States. They have been attributed to genetic and other factors such as drugs, cosmetics, food, and environmental pollutants during pregnancy, but for most birth defects there are no known causes. METHODS: To further characterize associations between small molecule compounds and their potential to induce specific birth abnormalities, we gathered knowledge from multiple sources to construct a reproductive toxicity Knowledge Graph (ReproTox-KG) with a focus on associations between birth defects, drugs, and genes. Specifically, we gathered data from drug/birth-defect associations from co-mentions in published abstracts, gene/birth-defect associations from genetic studies, drug- and preclinical-compound-induced gene expression changes in cell lines, known drug targets, genetic burden scores for human genes, and placental crossing scores for small molecules. RESULTS: Using ReproTox-KG and semi-supervised learning (SSL), we scored >30,000 preclinical small molecules for their potential to cross the placenta and induce birth defects, and identified >500 birth-defect/gene/drug cliques that can be used to explain molecular mechanisms for drug-induced birth defects. The ReproTox-KG can be accessed via a web-based user interface available at https://maayanlab.cloud/reprotox-kg . This site enables users to explore the associations between birth defects, approved and preclinical drugs, and all human genes. CONCLUSIONS: ReproTox-KG provides a resource for exploring knowledge about the molecular mechanisms of birth defects with the potential of predicting the likelihood of genes and preclinical small molecules to induce birth defects.

While birth defects are common, for most birth defects there are no known causes. During pregnancy, developing babies are exposed to drugs, cosmetics, food, and environmental pollutants that may cause birth defects. However, exactly how these environmental factors are involved in producing birth defects is difficult to discern. Also, birth defects can be a consequence of the genes inherited from the parents. We combined general data about human genes and drugs with specific data previously implicating genes and drugs in inducing birth defects to create a knowledge graph representation that connects genes, drugs, and birth defects. This knowledge graph can be used to explore new links that may explain why birth defects occur, particularly those that result from a combination of inherited and environmental influences.

20.

Machine learning prediction and tau-based screening identifies potential Alzheimer's disease genes relevant to immunity.

Binder, Jessica; Ursu, Oleg; Bologa, Cristian; Jiang, Shanya; Maphis, Nicole; Dadras, Somayeh; Chisholm, Devon; Weick, Jason; Myers, Orrin; Kumar, Praveen; Yang, Jeremy J; Bhaskar, Kiran; Oprea, Tudor I.

Commun Biol ; 5(1): 125, 2022 02 11.

Artigo em Inglês | MEDLINE | ID: mdl-35149761

RESUMO

With increased research funding for Alzheimer's disease (AD) and related disorders across the globe, large amounts of data are being generated. Several studies employed machine learning methods to understand the ever-growing omics data to enhance early diagnosis, map complex disease networks, or uncover potential drug targets. We describe results based on a Target Central Resource Database protein knowledge graph and evidence paths transformed into vectors by metapath matching. We extracted features between specific genes and diseases, then trained and optimized our model using XGBoost, termed MPxgb(AD). To determine our MPxgb(AD) prediction performance, we examined the top twenty predicted genes through an experimental screening pipeline. Our analysis identified potential AD risk genes: FRRS1, CTRAM, SCGB3A1, FAM92B/CIBAR2, and TMEFF2. FRRS1 and FAM92B are considered dark genes, while CTRAM, SCGB3A1, and TMEFF2 are connected to TREM2-TYROBP, IL-1ß-TNFα, and MTOR-APP AD-risk nodes, suggesting relevance to the pathogenesis of AD.

Assuntos

Doença de Alzheimer , Doença de Alzheimer/diagnóstico , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Diagnóstico Precoce , Humanos , Aprendizado de Máquina , Proteínas de Membrana/metabolismo , Proteínas de Neoplasias

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA