Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 99
Filtrar
1.
Nucleic Acids Res ; 52(D1): D1508-D1518, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37897343

RESUMO

Knowledge of the collective activities of individual plants together with the derived clinical effects and targeted disease associations is useful for plant-based biomedical research. To provide the information in complement to the established databases, we introduced a major update of CMAUP database, previously featured in NAR. This update includes (i) human transcriptomic changes overlapping with 1152 targets of 5765 individual plants, covering 74 diseases from 20 027 patient samples; (ii) clinical information for 185 individual plants in 691 clinical trials; (iii) drug development information for 4694 drug-producing plants with metabolites developed into approved or clinical trial drugs; (iv) plant and human disease associations (428 737 associations by target, 220 935 reversion of transcriptomic changes, 764 and 154121 associations by clinical trials of individual plants and plant ingredients); (v) the location of individual plants in the phylogenetic tree for navigating taxonomic neighbors, (vi) DNA barcodes of 3949 plants, (vii) predicted human oral bioavailability of plant ingredients by the established SwissADME and HobPre algorithm, (viii) 21-107% increase of CMAUP data over the previous version to cover 60 222 chemical ingredients, 7865 plants, 758 targets, 1399 diseases, 238 KEGG human pathways, 3013 gene ontologies and 1203 disease ontologies. CMAUP update version is freely accessible at https://bidd.group/CMAUP/index.html.


Assuntos
Bases de Dados Factuais , Compostos Fitoquímicos , Plantas Medicinais , Humanos , Filogenia , Plantas Medicinais/química , Plantas Medicinais/classificação , Compostos Fitoquímicos/química , Compostos Fitoquímicos/farmacologia , Compostos Fitoquímicos/uso terapêutico
2.
Nucleic Acids Res ; 51(D1): D621-D628, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36624664

RESUMO

Quantitative activity and species source data of natural products (NPs) are important for drug discovery, medicinal plant research, and microbial investigations. Activity values of NPs against specific targets are useful for discovering targeted therapeutic agents and investigating the mechanism of medicinal plants. Composition/concentration values of NPs in individual species facilitate the assessments and investigations of the therapeutic quality of herbs and phenotypes of microbes. Here, we describe an update of the NPASS natural product activity and species source database previously featured in NAR. This update includes: (i) new data of ∼95 000 records of the composition/concentration values of ∼1 490 NPs/NP clusters in ∼390 species, (ii) extended data of activity values of ∼43 200 NPs against ∼7 700 targets (∼40% and ∼32% increase, respectively), (iii) extended data of ∼31 600 species sources of ∼94 400 NPs (∼26% and ∼32% increase, respectively), (iv) new species types of ∼440 co-cultured microbes and ∼420 engineered microbes, (v) new data of ∼66 600 NPs without experimental activity values but with estimated activity profiles from the established chemical similarity tool Chemical Checker, (vi) new data of the computed drug-likeness properties and the absorption, distribution, metabolism, excretion and toxicity (ADMET) properties for all NPs. NPASS update version is freely accessible at http://bidd.group/NPASS.


Assuntos
Produtos Biológicos , Pesquisa Biomédica , Bases de Dados Factuais , Descoberta de Drogas , Preparações Farmacêuticas/isolamento & purificação
3.
Nucleic Acids Res ; 50(8): e45, 2022 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-35100418

RESUMO

Omics-based biomedical learning frequently relies on data of high-dimensions (up to thousands) and low-sample sizes (dozens to hundreds), which challenges efficient deep learning (DL) algorithms, particularly for low-sample omics investigations. Here, an unsupervised novel feature aggregation tool AggMap was developed to Aggregate and Map omics features into multi-channel 2D spatial-correlated image-like feature maps (Fmaps) based on their intrinsic correlations. AggMap exhibits strong feature reconstruction capabilities on a randomized benchmark dataset, outperforming existing methods. With AggMap multi-channel Fmaps as inputs, newly-developed multi-channel DL AggMapNet models outperformed the state-of-the-art machine learning models on 18 low-sample omics benchmark tasks. AggMapNet exhibited better robustness in learning noisy data and disease classification. The AggMapNet explainable module Simply-explainer identified key metabolites and proteins for COVID-19 detections and severity predictions. The unsupervised AggMap algorithm of good feature restructuring abilities combined with supervised explainable AggMapNet architecture establish a pipeline for enhanced learning and interpretability of low-sample omics data.


Assuntos
COVID-19 , Aprendizado Profundo , Algoritmos , Humanos , Aprendizado de Máquina , Proteínas
4.
Planta ; 258(3): 58, 2023 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-37528331

RESUMO

Extensive spaceflight life investigations (SLIs) have revealed observable space effects on plants, particularly their growth, nutrition yield, and secondary metabolite production. Knowledge of these effects not only facilitates space agricultural and biopharmaceutical technology development but also provides unique perspectives to ground-based investigations. SLIs are specialized experimental protocols and notable biological phenomena. These require specialized databases, leading to the development of the NASA Science Data Archive, Erasmus Experiment Archive, and NASA GeneLab. The increasing interests of SLIs across diverse fields demand resources with comprehensive content, convenient search facilities, and friendly information presentation. A new database SpaceLID (Space Life Investigation Database http://bidd.group/spacelid/ ) was developed with detailed menu search tools and categorized contents about the phenomena, protocols, and outcomes of 459 SLIs (including 106 plant investigations) of 92 species, where 236 SLIs and 57 plant investigations are uncovered by the existing databases. The usefulness of SpaceLID as an SLI information source is illustrated by the literature-reported analysis of metabolite, nutrition, and symbiosis variations of spaceflight plants. In conclusion, this study extensively investigated the impact of the space environment on plant biology, utilizing SpaceLID as an information source and examining various plant species, including Arabidopsis thaliana, Brassica rapa L., and Glycyrrhiza uralensis Fisch. The findings provide valuable insights into the effects of space conditions on plant physiology and metabolism.


Assuntos
Arabidopsis , Brassica rapa , Voo Espacial , Ausência de Peso , Plantas , Biologia
5.
J Chem Inf Model ; 63(15): 4615-4622, 2023 08 14.
Artigo em Inglês | MEDLINE | ID: mdl-37531205

RESUMO

Infrared (IR) spectroscopy is a powerful and versatile tool for analyzing functional groups in organic compounds. A complex and time-consuming interpretation of massive unknown spectra usually requires knowledge of chemistry and spectroscopy. This paper presents a new deep learning method for transforming IR spectral features into intuitive imagelike feature maps and prediction of major functional groups. We obtained 8272 gas-phase IR spectra from the NIST Chemistry WebBook. Feature maps are constructed using the intrinsic correlation of spectral data, and prediction models are developed based on convolutional neural networks. Twenty-one major functional groups for each molecule are successfully identified using binary and multilabel models without expert guidance and feature selection. The multilabel classification model can produce all prediction results simultaneously for rapid characterization. Further analysis of the detailed substructures indicates that our model is capable of obtaining abundant structural information from IR spectra for a comprehensive investigation. The interpretation of our model reveals that the peaks of most interest are similar to those often considered by spectroscopists. In addition to demonstrating great potential for spectral identification, our method may contribute to the development of automated analyses in many fields.


Assuntos
Aprendizado Profundo , Espectrofotometria Infravermelho
6.
Nucleic Acids Res ; 49(D1): D776-D782, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33125077

RESUMO

Xenobiotic and host active substances interact with gut microbiota to influence human health and therapeutics. Dietary, pharmaceutical, herbal and environmental substances are modified by microbiota with altered bioavailabilities, bioactivities and toxic effects. Xenobiotics also affect microbiota with health implications. Knowledge of these microbiota and active substance interactions is important for understanding microbiota-regulated functions and therapeutics. Established microbiota databases provide useful information about the microbiota-disease associations, diet and drug interventions, and microbiota modulation of drugs. However, there is insufficient information on the active substances modified by microbiota and the abundance of gut bacteria in humans. Only ∼7% drugs are covered by the established databases. To complement these databases, we developed MASI, Microbiota-Active Substance Interactions database, for providing the information about the microbiota alteration of various substances, substance alteration of microbiota, and the abundance of gut bacteria in humans. These include 1,051 pharmaceutical, 103 dietary, 119 herbal, 46 probiotic, 142 environmental substances interacting with 806 microbiota species linked to 56 diseases and 784 microbiota-disease associations. MASI covers 11 215 bacteria-pharmaceutical, 914 bacteria-herbal, 309 bacteria-dietary, 753 bacteria-environmental substance interactions and the abundance profiles of 259 bacteria species in 3465 patients and 5334 healthy individuals. MASI is freely accessible at http://www.aiddlab.com/MASI.


Assuntos
Bases de Dados como Assunto , Microbiota , Microbioma Gastrointestinal , Saúde , Humanos , Filogenia , Interface Usuário-Computador
7.
Brief Bioinform ; 21(6): 2206-2218, 2020 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-31799600

RESUMO

Protein dynamics is central to all biological processes, including signal transduction, cellular regulation and biological catalysis. Among them, in-depth exploration of ligand-driven protein dynamics contributes to an optimal understanding of protein function, which is particularly relevant to drug discovery. Hence, a wide range of computational tools have been designed to investigate the important dynamic information in proteins. However, performing and analyzing protein dynamics is still challenging due to the complicated operation steps, giving rise to great difficulty, especially for nonexperts. Moreover, there is a lack of web protocol to provide online facility to investigate and visualize ligand-driven protein dynamics. To this end, in this study, we integrated several bioinformatic tools to develop a protocol, named Ligand and Receptor Molecular Dynamics (LARMD, http://chemyang.ccnu.edu.cn/ccb/server/LARMD/ and http://agroda.gzu.edu.cn:9999/ccb/server/LARMD/), for profiling ligand-driven protein dynamics. To be specific, estrogen receptor (ER) was used as a case to reveal ERß-selective mechanism, which plays a vital role in the treatment of inflammatory diseases and many types of cancers in clinical practice. Two different residues (Ile373/Met421 and Met336/Leu384) in the pocket of ERß/ERα were the significant determinants for selectivity, especially Met336 of ERß. The helix H8, helix H11 and H7-H8 loop influenced the migration of selective agonist (WAY-244). These computational results were consistent with the experimental results. Therefore, LARMD provides a user-friendly online protocol to study the dynamic property of protein and to design new ligand or site-directed mutagenesis.


Assuntos
Biologia Computacional , Receptor alfa de Estrogênio , Receptor beta de Estrogênio , Simulação de Dinâmica Molecular , Biologia Computacional/métodos , Descoberta de Drogas , Receptor alfa de Estrogênio/química , Receptor alfa de Estrogênio/metabolismo , Receptor beta de Estrogênio/química , Receptor beta de Estrogênio/metabolismo , Ligantes
8.
Brief Bioinform ; 21(2): 649-662, 2020 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-30689717

RESUMO

Drugs produce their therapeutic effects by modulating specific targets, and there are 89 innovative targets of first-in-class drugs approved in 2004-17, each with information about drug clinical trial dated back to 1984. Analysis of the clinical trial timelines of these targets may reveal the trial-speed differentiating features for facilitating target assessment. Here we present a comprehensive analysis of all these 89 targets, following the earlier studies for prospective prediction of clinical success of the targets of clinical trial drugs. Our analysis confirmed the literature-reported common druggability characteristics for clinical success of these innovative targets, exposed trial-speed differentiating features associated to the on-target and off-target collateral effects in humans and further revealed a simple rule for identifying the speedy human targets through clinical trials (from the earliest phase I to the 1st drug approval within 8 years). This simple rule correctly identified 75.0% of the 28 speedy human targets and only unexpectedly misclassified 13.2% of 53 non-speedy human targets. Certain extraordinary circumstances were also discovered to likely contribute to the misclassification of some human targets by this simple rule. Investigation and knowledge of trial-speed differentiating features enable prioritized drug discovery and development.


Assuntos
Ensaios Clínicos como Assunto , Aprovação de Drogas , Descoberta de Drogas , Humanos , Estudos de Tempo e Movimento
9.
Nucleic Acids Res ; 47(D1): D1118-D1127, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30357356

RESUMO

The beneficial effects of functionally useful plants (e.g. medicinal and food plants) arise from the multi-target activities of multiple ingredients of these plants. The knowledge of the collective molecular activities of these plants facilitates mechanistic studies and expanded applications. A number of databases provide information about the effects and targets of various plants and ingredients. More comprehensive information is needed for broader classes of plants and for the landscapes of individual plant's multiple targets, collective activities and regulated biological pathways, processes and diseases. We therefore developed a new database, Collective Molecular Activities of Useful Plants (CMAUP), to provide the collective landscapes of multiple targets (ChEMBL target classes) and activity levels (in 2D target-ingredient heatmap), and regulated gene ontologies (GO categories), biological pathways (KEGG categories) and diseases (ICD blocks) for 5645 plants (2567 medicinal, 170 food, 1567 edible, 3 agricultural and 119 garden plants) collected from or traditionally used in 153 countries and regions. These landscapes were derived from 47 645 plant ingredients active against 646 targets in 234 KEGG pathways associated with 2473 gene ontologies and 656 diseases. CMAUP (http://bidd2.nus.edu.sg/CMAUP/) is freely accessible and searchable by keywords, plant usage classes, species families, targets, KEGG pathways, gene ontologies, diseases (ICD code) and geographical locations.


Assuntos
Biologia Computacional/métodos , Produtos Agrícolas/química , Bases de Dados Factuais , Preparações de Plantas/uso terapêutico , Plantas Medicinais/química , Biologia Computacional/estatística & dados numéricos , Descoberta de Drogas/métodos , Armazenamento e Recuperação da Informação/métodos , Internet , Terapia de Alvo Molecular/métodos , Transdução de Sinais/efeitos dos fármacos , Interface Usuário-Computador
10.
Drug Dev Res ; 82(1): 133-142, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-32931039

RESUMO

Cancers resist targeted therapeutics by drug-escape signaling. Multitarget drugs co-targeting cancer and drug-escape mediators (DEMs) are clinically advantageous. DEM coverage may be expanded by drug combinations. This work evaluated to what extent the kinase DEMs (KDEMs) can be optimally co-targeted by drug combinations based on target promiscuities of individual drugs. We focused on 41 approved and 28 clinical trial small molecule kinase inhibitor drugs with available experimental kinome and clinical pharmacokinetic data. From the kinome inhibitory profiles of these drugs, drug combinations were assembled for optimally co-targeting an established cancer target (EGFR, HER2, ABL1, or MEK1) and 9-16 target-associated KDEMs at comparable potency levels as that against the cancer target. Each set of two-, three-, and four-drug combinations co-target 36-71%, 44-89%, 50-88%, and 27-55% KDEMs of EGFR, HER2, ABL1, and MEK1, respectively, compared with the 36, 33, 38, and 18% KDEMs maximally co-targeted by an existing drug or drug combination approved or clinically tested for the respective cancer. Some co-targeted KDEMs are not covered by any existing drug or drug combination. Our work suggested that novel drug combinations may be constructed for optimally co-targeting cancer and drug escape by the exploitation of drug target promiscuities.


Assuntos
Antineoplásicos/administração & dosagem , Inibidores de Proteínas Quinases/administração & dosagem , Antineoplásicos/farmacocinética , Combinação de Medicamentos , Sistemas de Liberação de Medicamentos , Resistencia a Medicamentos Antineoplásicos , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/metabolismo , Inibidores de Proteínas Quinases/farmacocinética , Proteínas Quinases/metabolismo
11.
Nucleic Acids Res ; 46(D1): D1217-D1222, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29106619

RESUMO

There has been renewed interests in the exploration of natural products (NPs) for drug discovery, and continuous investigations of the therapeutic claims and mechanisms of traditional and herbal medicines. In-silico methods have been employed for facilitating these studies. These studies and the optimization of in-silico algorithms for NP applications can be facilitated by the quantitative activity and species source data of the NPs. A number of databases collectively provide the structural and other information of ∼470 000 NPs, including qualitative activity information for many NPs, but only ∼4000 NPs are with the experimental activity values. There is a need for the activity and species source data of more NPs. We therefore developed a new database, NPASS (Natural Product Activity and Species Source) to complement other databases by providing the experimental activity values and species sources of 35 032 NPs from 25 041 species targeting 5863 targets (2946 proteins, 1352 microbial species and 1227 cell-lines). NPASS contains 446 552 quantitative activity records (e.g. IC50, Ki, EC50, GI50 or MIC mainly in units of nM) of 222 092 NP-target pairs and 288 002 NP-species pairs. NPASS, http://bidd2.nus.edu.sg/NPASS/, is freely accessible with its contents searchable by keywords, physicochemical property range, structural similarity, species and target search facilities.


Assuntos
Produtos Biológicos/química , Produtos Biológicos/farmacologia , Bases de Dados Factuais , Animais , Coleta de Dados , Descoberta de Drogas/métodos , Internet , Interface Usuário-Computador , Navegador
12.
Nucleic Acids Res ; 46(D1): D1121-D1127, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29140520

RESUMO

Extensive efforts have been directed at the discovery, investigation and clinical monitoring of targeted therapeutics. These efforts may be facilitated by the convenient access of the genetic, proteomic, interactive and other aspects of the therapeutic targets. Here, we describe an update of the Therapeutic target database (TTD) previously featured in NAR. This update includes: (i) 2000 drug resistance mutations in 83 targets and 104 target/drug regulatory genes, which are resistant to 228 drugs targeting 63 diseases (49 targets of 61 drugs with patient prevalence data); (ii) differential expression profiles of 758 targets in the disease-relevant drug-targeted tissue of 12 615 patients of 70 diseases; (iii) expression profiles of 629 targets in the non-targeted tissues of 2565 healthy individuals; (iv) 1008 target combinations of 1764 drugs and the 1604 target combination of 664 multi-target drugs; (v) additional 48 successful, 398 clinical trial and 21 research targets, 473 approved, 812 clinical trial and 1120 experimental drugs, and (vi) ICD-10-CM and ICD-9-CM codes for additional 482 targets and 262 drugs against 98 disease conditions. This update makes TTD more useful for facilitating the patient focused research, discovery and clinical investigations of the targeted therapeutics. TTD is accessible at http://bidd.nus.edu.sg/group/ttd/ttd.asp.


Assuntos
Bases de Dados Factuais , Resistência a Medicamentos/genética , Tratamento Farmacológico , Expressão Gênica , Combinação de Medicamentos , Humanos , Internet , Terapia de Alvo Molecular , Mutação
13.
Brief Bioinform ; 18(6): 1057-1070, 2017 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-27542402

RESUMO

The genetic, proteomic, disease and pharmacological studies have generated rich data in protein interaction, disease regulation and drug activities useful for systems-level study of the biological, disease and drug therapeutic processes. These studies are facilitated by the established and the emerging computational methods. More recently, the network descriptors developed in other disciplines have become more increasingly used for studying the protein-protein, gene regulation, metabolic, disease networks. There is an inadequate coverage of these useful network features in the public web servers. We therefore introduced upto 313 literature-reported network descriptors in PROFEAT web server, for describing the topological, connectivity and complexity characteristics of undirected unweighted (uniform binding constants and molecular levels), undirected edge-weighted (varying binding constants), undirected node-weighted (varying molecular levels), undirected edge-node-weighted (varying binding constants and molecular levels) and directed unweighted (oriented process) networks. The usefulness of the PROFEAT computed network descriptors is illustrated by their literature-reported applications in studying the protein-protein, gene regulatory, gene co-expression, protein-drug and metabolic networks. PROFEAT is accessible free of charge at http://bidd2.nus.edu.sg/cgi-bin/profeat2016/main.cgi.


Assuntos
Biologia Computacional/métodos , Doença/classificação , Redes Reguladoras de Genes , Redes e Vias Metabólicas , Preparações Farmacêuticas , Mapeamento de Interação de Proteínas , Software , Algoritmos , Bases de Dados de Proteínas , Humanos , Internet , Biologia de Sistemas/métodos
14.
Drug Dev Res ; 80(2): 246-252, 2019 03.
Artigo em Inglês | MEDLINE | ID: mdl-30422335

RESUMO

The clinical advantage of co-targeting cancer drug escape has been indicated by the percentage of these co-targeting drugs among all multi-target drugs in clinics and clinical trials. This clinical advantage needs to be further interrogated from such perspectives as the clinical impact of multi-target inhibition of drug-escape mediators. This impact may be reflected by drug sales data, that is, multi-target inhibition of higher number of drug-escape mediators favors the expanded coverage of drug-resistant patients leading to higher sales. We investigated whether this expectation is followed by the 25 FDA-approved anticancer kinase inhibitors, which were divided into 11 groups of comparable therapeutic mechanisms and approval years. We found 19 (76%) drugs to follow and 3 (12%) drugs not to follow this expectation. The remaining two (8%) and one (4%) drugs cannot be assessed due to insufficient data and incomparability. Therefore, drug sales strongly indicate the clinical advantage of multi-target inhibition of cancer drug escapes.


Assuntos
Antineoplásicos/economia , Resistencia a Medicamentos Antineoplásicos , Terapia de Alvo Molecular , Neoplasias/economia , Inibidores de Proteínas Quinases/economia , Antineoplásicos/uso terapêutico , Comércio , Aprovação de Drogas , Humanos , Neoplasias/tratamento farmacológico , Inibidores de Proteínas Quinases/uso terapêutico , Resultado do Tratamento , Estados Unidos , United States Food and Drug Administration
15.
Molecules ; 24(20)2019 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-31614517

RESUMO

Human A3 adenosine receptor hA3AR has been implicated in gastrointestinal cancer, where its cellular expression has been found increased, thus suggesting its potential as a molecular target for novel anticancer compounds. Observation made in our previous work indicated the importance of the carbonyl group of amide in the indolylpyrimidylpiperazine (IPP) for its human A2A adenosine receptor (hA2AAR) subtype binding selectivity over the other AR subtypes. Taking this observation into account, we structurally modified an indolylpyrimidylpiperazine (IPP) scaffold, 1 (a non-selective adenosine receptors' ligand) into a modified IPP (mIPP) scaffold by switching the position of the carbonyl group, resulting in the formation of both ketone and tertiary amine groups in the new scaffold. Results showed that such modification diminished the A2A activity and instead conferred hA3AR agonistic activity. Among the new mIPP derivatives (3-6), compound 4 showed potential as a hA3AR partial agonist, with an Emax of 30% and EC50 of 2.89 ± 0.55 µM. In the cytotoxicity assays, compound 4 also exhibited higher cytotoxicity against both colorectal and liver cancer cells as compared to normal cells. Overall, this new series of compounds provide a promising starting point for further development of potent and selective hA3AR partial agonists for the treatment of gastrointestinal cancers.


Assuntos
Neoplasias Gastrointestinais/tratamento farmacológico , Pirimidinonas/química , Receptor A2A de Adenosina/genética , Receptor A3 de Adenosina/genética , Antagonistas do Receptor A2 de Adenosina/síntese química , Antagonistas do Receptor A2 de Adenosina/química , Antagonistas do Receptor A2 de Adenosina/farmacologia , Animais , Células CHO , Proliferação de Células/efeitos dos fármacos , Cricetinae , Cricetulus , Neoplasias Gastrointestinais/genética , Neoplasias Gastrointestinais/patologia , Humanos , Indóis/síntese química , Indóis/química , Indóis/farmacologia , Modelos Moleculares , Piperazina/síntese química , Piperazina/química , Piperazina/farmacologia , Pirimidinonas/síntese química , Pirimidinonas/farmacologia , Receptor A2A de Adenosina/química , Relação Estrutura-Atividade
16.
Proteins ; 86(9): 978-989, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30051928

RESUMO

G-protein-coupled receptor (GPCR) is an important target class of proteins for drug discovery, with over 27% of FDA-approved drugs targeting GPCRs. However, being a membrane protein, it is difficult to obtain the 3D crystal structures of GPCRs for virtual screening of ligands by molecular docking. Thus, we evaluated the virtual screening performance of homology models of human GPCRs with respect to the corresponding crystal structures. Among the 19 GPCRs involved in this study, we observed that 10 GPCRs have homology models that have better or comparable performance with respect to the corresponding X-ray structures, making homology models a viable choice for virtual screening. For a small subset of GPCRs, we also explored how certain methods like consensus enrichment and sidechain perturbation affect the utility of homology models in virtual screening, as well as the selectivity between agonists and antagonists. Most notably, consensus enrichment across multiple homology models often yields results comparable to the best performing model, suggesting that ligand candidates predicted with consensus scores from multiple models can be the optimal option in practical applications where the performance of each model cannot be estimated.


Assuntos
Simulação de Acoplamento Molecular , Receptores Acoplados a Proteínas G/química , Sítios de Ligação , Cristalografia por Raios X , Bases de Dados de Compostos Químicos , Humanos , Ligantes , Ligação Proteica , Conformação Proteica , Receptores Acoplados a Proteínas G/agonistas , Receptores Acoplados a Proteínas G/antagonistas & inibidores , Homologia de Sequência de Aminoácidos , Relação Estrutura-Atividade , Termodinâmica
18.
Bioinformatics ; 33(20): 3276-3282, 2017 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-28549078

RESUMO

MOTIVATION: Genetic and gene expression variations within and between populations and across geographical regions have substantial effects on the biological phenotypes, diseases, and therapeutic response. The development of precision medicines can be facilitated by the OMICS studies of the patients of specific ethnicity and geographic region. However, there is an inadequate facility for broadly and conveniently accessing the ethnic and regional specific OMICS data. RESULTS: Here, we introduced a new free database, HEROD, a human ethnic and regional specific OMICS database. Its first version contains the gene expression data of 53 070 patients of 169 diseases in seven ethnic populations from 193 cities/regions in 49 nations curated from the Gene Expression Omnibus (GEO), the ArrayExpress Archive of Functional Genomics Data (ArrayExpress), the Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC). Geographic region information of curated patients was mainly manually extracted from referenced publications of each original study. These data can be accessed and downloaded via keyword search, World map search, and menu-bar search of disease name, the international classification of disease code, geographical region, location of sample collection, ethnic population, gender, age, sample source organ, patient type (patient or healthy), sample type (disease or normal tissue) and assay type on the web interface. AVAILABILITY AND IMPLEMENTATION: The HEROD database is freely accessible at http://bidd2.nus.edu.sg/herod/index.php. The database and web interface are implemented in MySQL, PHP and HTML with all major browsers supported. CONTACT: phacyz@nus.edu.sg.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Variação Genética , Genoma Humano , Grupos Populacionais/genética , Transcriptoma , Predisposição Genética para Doença , Humanos , Internet , Neoplasias/genética
19.
Nucleic Acids Res ; 44(D1): D1069-74, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26578601

RESUMO

Extensive drug discovery efforts have yielded many approved and candidate drugs targeting various targets in different biological pathways. Several freely accessible databases provide the drug, target and drug-targeted pathway information for facilitating drug discovery efforts, but there is an insufficient coverage of the clinical trial drugs and the drug-targeted pathways. Here, we describe an update of the Therapeutic Target Database (TTD) previously featured in NAR. The updated contents include: (i) significantly increased coverage of the clinical trial targets and drugs (1.6 and 2.3 times of the previous release, respectively), (ii) cross-links of most TTD target and drug entries to the corresponding pathway entries of KEGG, MetaCyc/BioCyc, NetPath, PANTHER pathway, Pathway Interaction Database (PID), PathWhiz, Reactome and WikiPathways, (iii) the convenient access of the multiple targets and drugs cross-linked to each of these pathway entries and (iv) the recently emerged approved and investigative drugs. This update makes TTD a more useful resource to complement other databases for facilitating the drug discovery efforts. TTD is accessible at http://bidd.nus.edu.sg/group/ttd/ttd.asp.


Assuntos
Bases de Dados de Produtos Farmacêuticos , Descoberta de Drogas , Ensaios Clínicos como Assunto , Internet , Transdução de Sinais/efeitos dos fármacos
20.
Int J Mol Sci ; 19(1)2018 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-29316706

RESUMO

The function of a protein is of great interest in the cutting-edge research of biological mechanisms, disease development and drug/target discovery. Besides experimental explorations, a variety of computational methods have been designed to predict protein function. Among these in silico methods, the prediction of BLAST is based on protein sequence similarity, while that of machine learning is also based on the sequence, but without the consideration of their similarity. This unique characteristic of machine learning makes it a good complement to BLAST and many other approaches in predicting the function of remotely relevant proteins and the homologous proteins of distinct function. However, the identification accuracies of these in silico methods and their false discovery rate have not yet been assessed so far, which greatly limits the usage of these algorithms. Herein, a comprehensive comparison of the performances among four popular prediction algorithms (BLAST, SVM, PNN and KNN) was conducted. In particular, the performance of these methods was systematically assessed by four standard statistical indexes based on the independent test datasets of 93 functional protein families defined by UniProtKB keywords. Moreover, the false discovery rates of these algorithms were evaluated by scanning the genomes of four representative model organisms (Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae and Mycobacterium tuberculosis). As a result, the substantially higher sensitivity of SVM and BLAST was observed compared with that of PNN and KNN. However, the machine learning algorithms (PNN, KNN and SVM) were found capable of substantially reducing the false discovery rate (SVM < PNN < KNN). In sum, this study comprehensively assessed the performance of four popular algorithms applied to protein function prediction, which could facilitate the selection of the most appropriate method in the related biomedical research.


Assuntos
Análise de Sequência de Proteína/normas , Software , Aprendizado de Máquina , Proteômica/métodos , Proteômica/normas , Reprodutibilidade dos Testes , Análise de Sequência de Proteína/métodos
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa