Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
Nucleic Acids Res ; 52(D1): D552-D561, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37819028

RESUMO

Single-cell proteomics (SCP) has emerged as a powerful tool for detecting cellular heterogeneity, offering unprecedented insights into biological mechanisms that are masked in bulk cell populations. With the rapid advancements in AI-based time trajectory analysis and cell subpopulation identification, there exists a pressing need for a database that not only provides SCP raw data but also explicitly describes experimental details and protein expression profiles. However, no such database has been available yet. In this study, a database, entitled 'SingPro', specializing in single-cell proteomics was thus developed. It was unique in (a) systematically providing the SCP raw data for both mass spectrometry-based and flow cytometry-based studies and (b) explicitly describing experimental detail for SCP study and expression profile of any studied protein. Anticipating a robust interest from the research community, this database is poised to become an invaluable repository for OMICs-based biomedical studies. Access to SingPro is unrestricted and does not mandate a login at: http://idrblab.org/singpro/.


Assuntos
Bases de Dados de Proteínas , Processamento de Proteína Pós-Traducional , Proteômica , Bases de Conhecimento , Espectrometria de Massas , Análise de Célula Única
2.
Nucleic Acids Res ; 2024 Oct 07.
Artigo em Inglês | MEDLINE | ID: mdl-39373530

RESUMO

The measurement of cell-based molecular bioactivity (CMB) is critical for almost every step of drug development. With the booming application of AI in biomedicine, it is essential to have the CMB data to promote the learning of cell-based patterns for guiding modern drug discovery, but no database providing such information has been constructed yet. In this study, we introduce MolBiC, a knowledge base designed to describe valuable data on molecular bioactivity measured within a cellular context. MolBiC features 550 093 experimentally validated CMBs, encompassing 321 086 molecules and 2666 targets across 988 cell lines. Our MolBiC database is unique in describing the valuable data of CMB, which meets the critical demands for CMB-based big data promoting the learning of cell-based molecular/pharmaceutical pattern in drug discovery and development. MolBiC is now freely accessible without any login requirement at: https://idrblab.org/MolBiC/.

3.
Nucleic Acids Res ; 2024 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-39360619

RESUMO

Peptide-drug conjugates (PDCs) have emerged as a promising class of targeted therapeutics with substantial pharmaceutical advantages and market potentials, which is a combination of a peptide (selective to the disease-relevant target), a linker (stable in circulation but cleavable at target site) and a cytotoxic/radioactive drug (efficacious/traceable for disease). Among existing PDCs, those based on radiopharmaceuticals (a.k.a. radioactive drugs) are valued due to their accurate imaging and targeted destruction of disease sites. It's demanded to accumulate the biological activity and pharmaceutical information of PDCs. Herein, a database PDCdb was thus constructed to systematically describe these valuable data. Particularly, biological activities for 2036 PDCs were retrieved from literatures, which resulted in 1684, 613 and 2753 activity data generated based on clinical trial, animal model and cell line, respectively. Furthermore, the pharmaceutical information for all 2036 PDCs was collected, which gave the diverse data of (a) ADME property, plasma half-life and administration approach of a PDC and (b) chemical modification, primary target, mode of action, conjugating feature of the constituent peptide/linker/drug. In sum, PDCdb systematically provided the biological activities and pharmaceutical information for the most comprehensive list of PDCs among the available databases, which was expected to attract broad interest from related communities and could be freely accessible at: https://idrblab.org/PDCdb/.

4.
Nucleic Acids Res ; 51(D1): D546-D556, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36200814

RESUMO

Coronavirus has brought about three massive outbreaks in the past two decades. Each step of its life cycle invariably depends on the interactions among virus and host molecules. The interaction between virus RNA and host protein (IVRHP) is unique compared to other virus-host molecular interactions and represents not only an attempt by viruses to promote their translation/replication, but also the host's endeavor to combat viral pathogenicity. In other words, there is an urgent need to develop a database for providing such IVRHP data. In this study, a new database was therefore constructed to describe the interactions between coronavirus RNAs and host proteins (CovInter). This database is unique in (a) unambiguously characterizing the interactions between virus RNA and host protein, (b) comprehensively providing experimentally validated biological function for hundreds of host proteins key in viral infection and (c) systematically quantifying the differential expression patterns (before and after infection) of these key proteins. Given the devastating and persistent threat of coronaviruses, CovInter is highly expected to fill the gap in the whole process of the 'molecular arms race' between viruses and their hosts, which will then aid in the discovery of new antiviral therapies. It's now free and publicly accessible at: https://idrblab.org/covinter/.


Assuntos
Coronavirus , Interações Hospedeiro-Patógeno , RNA Viral , Humanos , Coronavirus/genética , Coronavirus/metabolismo , Infecções por Coronavirus/metabolismo , Interações Hospedeiro-Patógeno/genética , RNA Viral/genética , RNA Viral/metabolismo , Replicação Viral , Bases de Dados Genéticas
5.
Nucleic Acids Res ; 45(D1): D200-D203, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899674

RESUMO

NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints. An archive of pre-computed domain annotation is maintained for proteins tracked by NCBI's Entrez database, and live search services are offered as well. CDD curation staff supplements a comprehensive collection of protein domain and protein family models, which have been imported from external providers, with representations of selected domain families that are curated in-house and organized into hierarchical classifications of functionally distinct families and sub-families. CDD also supports comparative analyses of protein families via conserved domain architectures, and a recent curation effort focuses on providing functional characterizations of distinct subfamily architectures using SPARCLE: Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Domínios e Motivos de Interação entre Proteínas , Proteínas , Disseminação de Informação , Internet , Proteínas/química , Proteínas/classificação , Proteínas/genética
6.
Nucleic Acids Res ; 44(D1): D1202-13, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26400175

RESUMO

PubChem (https://pubchem.ncbi.nlm.nih.gov) is a public repository for information on chemical substances and their biological activities, launched in 2004 as a component of the Molecular Libraries Roadmap Initiatives of the US National Institutes of Health (NIH). For the past 11 years, PubChem has grown to a sizable system, serving as a chemical information resource for the scientific research community. PubChem consists of three inter-linked databases, Substance, Compound and BioAssay. The Substance database contains chemical information deposited by individual data contributors to PubChem, and the Compound database stores unique chemical structures extracted from the Substance database. Biological activity data of chemical substances tested in assay experiments are contained in the BioAssay database. This paper provides an overview of the PubChem Substance and Compound databases, including data sources and contents, data organization, data submission using PubChem Upload, chemical structure standardization, web-based interfaces for textual and non-textual searches, and programmatic access. It also gives a brief description of PubChem3D, a resource derived from theoretical three-dimensional structures of compounds in PubChem, as well as PubChemRDF, Resource Description Framework (RDF)-formatted PubChem data for data sharing, analysis and integration with information contained in other databases.


Assuntos
Bases de Dados de Compostos Químicos , Internet , Estrutura Molecular , Preparações Farmacêuticas/química , Software
7.
Nucleic Acids Res ; 40(Database issue): D400-12, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22140110

RESUMO

PubChem (http://pubchem.ncbi.nlm.nih.gov) is a public repository for biological activity data of small molecules and RNAi reagents. The mission of PubChem is to deliver free and easy access to all deposited data, and to provide intuitive data analysis tools. The PubChem BioAssay database currently contains 500,000 descriptions of assay protocols, covering 5000 protein targets, 30,000 gene targets and providing over 130 million bioactivity outcomes. PubChem's bioassay data are integrated into the NCBI Entrez information retrieval system, thus making PubChem data searchable and accessible by Entrez queries. Also, as a repository, PubChem constantly optimizes and develops its deposition system answering many demands of both high- and low-volume depositors. The PubChem information platform allows users to search, review and download bioassay description and data. The PubChem platform also enables researchers to collect, compare and analyze biological test results through web-based and programmatic tools. In this work, we provide an update for the PubChem BioAssay resource, including information content growth, data model extension and new developments of data submission, retrieval, analysis and download tools.


Assuntos
Bases de Dados Factuais , Descoberta de Drogas , Interferência de RNA , Bioensaio , Ensaios de Triagem em Larga Escala , Indicadores e Reagentes , Estrutura Molecular , Software
8.
Artigo em Inglês | MEDLINE | ID: mdl-39423074

RESUMO

This study introduces an innovative deep-learning model for cuffless blood pressure estimation using PPG and ECG signals, demonstrating state-of-the-art performance on the largest clean dataset, PulseDB. The rU-Net architecture, a fusion of U-Net and ResNet, enhances both generalization and feature extraction accuracy. Accurate multi-scale feature capture is facilitated by short-time Fourier transform (STFT) time-frequency distributions and multi-head attention mechanisms, allowing data-driven feature selection. The inclusion of demographic parameters as supervisory information further elevates performance. On the calibration-based dataset, our model excels, achieving outstanding accuracy (SBP MAE ± std: 4.49 ± 4.86 mmHg, DBP MAE ± std: 2.69 ± 3.10 mmHg), surpassing AAMI standards and earning a BHS Grade A rating. Addressing the challenge of calibration-free data, we propose a fine-tuning-based transfer learning approach. Remarkably, with only 10% data transfer, our model attains exceptional accuracy (SBP MAE ± std: 4.14 ± 5.01 mmHg, DBP MAE ± std: 2.48 ± 2.93 mmHg). This study sets the stage for the development of highly accurate and reliable wearable cuffless blood pressure monitoring devices.

9.
Comput Biol Med ; 169: 107886, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-38157777

RESUMO

RNA viruses are major human pathogens that cause seasonal epidemics and occasional pandemic outbreaks. Due to the nature of their RNA genomes, it is anticipated that virus's RNA interacts with host protein (INTPRO), messenger RNA (INTmRNA), and non-coding RNA (INTncRNA) to perform their particular functions during their transcription and replication. In other words, thus, it is urgently needed to have such valuable data on virus RNA-directed molecular interactions (especially INTPROs), which are highly anticipated to attract broad research interests in the fields of RNA virus translation and replication. In this study, a new database was constructed to describe the virus RNA-directed interaction (INTPRO, INTmRNA, INTncRNA) for RNA virus (RVvictor). This database is unique in a) unambiguously characterizing the interactions between viruses RNAs and host proteins, b) providing, for the first time, the most systematic RNA-directed interaction data resources in providing clues to understand the molecular mechanisms of RNA viruses' translation, and replication, and c) in RVvictor, comprehensive enrichment analysis is conducted for each virus RNA based on its associated target genes/proteins, and the enrichment results were explicitly illustrated using various graphs. We found significant enrichment of a suite of pathways related to infection, translation, and replication, e.g., HIV infection, coronavirus disease, regulation of viral genome replication, and so on. Due to the devastating and persistent threat posed by the RNA virus, RVvictor constructed, for the first time, a possible network of cross-talk in RNA-directed interaction, which may ultimately explain the pathogenicity of RNA virus infection. The knowledge base might help develop new anti-viral therapeutic targets in the future. It's now free and publicly accessible at: https://idrblab.org/rvvictor/.


Assuntos
Infecções por HIV , Vírus de RNA , Humanos , RNA Viral/genética , RNA Viral/metabolismo , Vírus de RNA/genética , Vírus de RNA/metabolismo , Replicação Viral/genética , Regulação da Expressão Gênica
10.
J Invest Dermatol ; 2024 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-39306031

RESUMO

Characterizing the metabolite fingerprint from the skin surface provides invaluable insights into skin biology and microbe-host interactions. To ensure data accuracy and reproducibility, it is essential to develop standard operating procedures for skin surface metabolomics. However, there is a notable lack of studies in this area. In this study, we thoroughly evaluated different sampling materials, extraction solvents, taping methods (frequency and number of tapes), and analytical techniques to optimize skin surface metabolomics. Our results showed that the combination of D-Squame D100 tape with a methyl tert-butyl ether/methanol extractant is optimal for skin surface lipidomics. Performing the skin-taping procedure 5 times with 1 tape yields sufficient biomass for lipid analysis, whereas the optimal taping procedure varies for water-soluble compounds. In addition, our study identified associations among the skin surface metabolites, some of which potentially underlie the formation of microbial cutotypes and offer insights into host-microbe interactions.

11.
Clin Transl Allergy ; 14(1): e12328, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38282190

RESUMO

BACKGROUND: Chronic spontaneous urticaria (CSU) is unpredictable and can severely impair patients' quality of life. Patients with CSU need a convenient, user-friendly platform to complete patient-reported outcome measures (PROMs) on their mobile devices. CRUSE® , the Chronic Urticaria Self Evaluation app, aims to address this unmet need. METHODS: CRUSE® was developed by an international steering committee of urticaria specialists. Priorities for the app based on recent findings in CSU were defined to allow patients to track and record their symptoms and medication use over time and send photographs. The CRUSE® app collects patient data such as age, sex, disease onset, triggers, medication, and CSU characteristics that can be sent securely to physicians, providing real-time insights. Additionally, CRUSE® contains PROMs to assess disease activity and control, which are individualised to patient profiles and clinical manifestations. RESULTS: CRUSE® was launched in Germany in March 2022 and is now available for free in 17 countries. It is adapted to the local language and displays a country-specific list of available urticaria medications. English and Ukrainian versions are available worldwide. From July 2022 to June 2023, 25,710 observations were documented by 2540 users; 72.7% were females, with a mean age of 39.6 years. At baseline, 93.7% and 51.3% of users had wheals and angioedema, respectively. Second-generation antihistamines were used in 74.0% of days. CONCLUSIONS: The initial data from CRUSE® show the wide use and utility of effectively tracking patients' disease activity and control, paving the way for personalised CSU management.

12.
Nat Commun ; 14(1): 6384, 2023 10 11.
Artigo em Inglês | MEDLINE | ID: mdl-37821436

RESUMO

Currently potential preclinical drugs for the treatment of nonalcoholic steatohepatitis (NASH) and NASH-related pathopoiesis have failed to achieve expected therapeutic efficacy due to the complexity of the pathogenic mechanisms. Here we show Tripartite motif containing 26 (TRIM26) as a critical endogenous suppressor of CCAAT/enhancer binding protein delta (C/EBPδ), and we also confirm that TRIM26 is an C/EBPδ-interacting partner protein that catalyses the ubiquitination degradation of C/EBPδ in hepatocytes. Hepatocyte-specific loss of Trim26 disrupts liver metabolic homeostasis, followed by glucose metabolic disorder, lipid accumulation, increased hepatic inflammation, and fibrosis, and dramatically facilitates NASH-related phenotype progression. Inversely, transgenic Trim26 overexpression attenuates the NASH-associated phenotype in a rodent or rabbit model. We provide mechanistic evidence that, in response to metabolic insults, TRIM26 directly interacts with C/EBPδ and promotes its ubiquitin proteasome degradation. Taken together, our present findings identify TRIM26 as a key suppressor over the course of NASH development.


Assuntos
Hepatopatia Gordurosa não Alcoólica , Animais , Coelhos , Hepatopatia Gordurosa não Alcoólica/genética , Hepatopatia Gordurosa não Alcoólica/prevenção & controle , Transdução de Sinais , Ubiquitinação , Ubiquitina-Proteína Ligases/genética , Ubiquitina-Proteína Ligases/metabolismo
13.
Nucleic Acids Res ; 38(Database issue): D492-6, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19854944

RESUMO

The NCBI BioSystems database, found at http://www.ncbi.nlm.nih.gov/biosystems/, centralizes and cross-links existing biological systems databases, increasing their utility and target audience by integrating their pathways and systems into NCBI resources. This integration allows users of NCBI's Entrez databases to quickly categorize proteins, genes and small molecules by metabolic pathway, disease state or other BioSystem type, without requiring time-consuming inference of biological relationships from the literature or multiple experimental datasets.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Bases de Dados de Ácidos Nucleicos , Biologia de Sistemas , Animais , Membrana Celular/metabolismo , Biologia Computacional/tendências , Bases de Dados de Proteínas , Genes , Genômica , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , National Library of Medicine (U.S.) , Software , Estados Unidos
14.
Nucleic Acids Res ; 38(Database issue): D787-91, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19933260

RESUMO

Increasing numbers of proteins, nucleic acids and other molecular entities have been explored as therapeutic targets, hundreds of which are targets of approved and clinical trial drugs. Knowledge of these targets and corresponding drugs, particularly those in clinical uses and trials, is highly useful for facilitating drug discovery. Therapeutic Target Database (TTD) has been developed to provide information about therapeutic targets and corresponding drugs. In order to accommodate increasing demand for comprehensive knowledge about the primary targets of the approved, clinical trial and experimental drugs, numerous improvements and updates have been made to TTD. These updates include information about 348 successful, 292 clinical trial and 1254 research targets, 1514 approved, 1212 clinical trial and 2302 experimental drugs linked to their primary targets (3382 small molecule and 649 antisense drugs with available structure and sequence), new ways to access data by drug mode of action, recursive search of related targets or drugs, similarity target and drug searching, customized and whole data download, standardized target ID, and significant increase of data (1894 targets, 560 diseases and 5028 drugs compared with the 433 targets, 125 diseases and 809 drugs in the original release described in previous paper). This database can be accessed at http://bidd.nus.edu.sg/group/cjttd/TTD.asp.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais , Bases de Dados Genéticas , Preparações Farmacêuticas/química , Química Farmacêutica/métodos , Ensaios Clínicos como Assunto , Biologia Computacional/tendências , Bases de Dados de Proteínas , Aprovação de Drogas , Tratamento Farmacológico/métodos , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , Software , Estados Unidos , United States Food and Drug Administration
15.
Front Public Health ; 10: 1042618, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36438265

RESUMO

Background: As an emerging technology, virtual reality (VR) has been broadly applied in the medical field, especially in neurorehabilitation. The growing application of VR therapy promotes an increasing amount of clinical studies. In this paper, we present a bibliometric analysis of the existing studies to reveal the current research hotspots and guide future research directions. Methods: Articles and reviews on the related topic were retrieved from the Science Citation Index Expanded of Web of Science Core Collection database. VOSviewer and Citespace software were applied to systematically analyze information about publications, countries, institutions, authors, journals, citations, and keywords from the included studies. Results: A total of 1,556 papers published between 1995 and 2021 were identified. The annual number of papers increased gradually over the past three decades, with a peak publication year in 2021 (n = 276). Countries and institutions from North America and Western European were playing leading roles in publications and total citations. Current hotspots were focused on the effectiveness of VR therapy in cognitive and upper limb motor rehabilitation. The clusters of keywords contained the four targeted neurological diseases of VR, while the burst keywords represented that the latest studies were directed toward more defined types of VR therapy and greater study design. Conclusions: Our study offers information regarding to the current hotspots and emerging trends in the VR for rehabilitation field. It could guide future research and application of VR therapy in neurorehabilitation.


Assuntos
Reabilitação Neurológica , Realidade Virtual , Humanos , Bibliometria , América do Norte
16.
IEEE Trans Med Imaging ; 40(9): 2403-2414, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-33945472

RESUMO

Instance segmentation is of great importance for many biological applications, such as study of neural cell interactions, plant phenotyping, and quantitatively measuring how cells react to drug treatment. In this paper, we propose a novel box-based instance segmentation method. Box-based instance segmentation methods capture objects via bounding boxes and then perform individual segmentation within each bounding box region. However, existing methods can hardly differentiate the target from its neighboring objects within the same bounding box region due to their similar textures and low-contrast boundaries. To deal with this problem, in this paper, we propose an object-guided instance segmentation method. Our method first detects the center points of the objects, from which the bounding box parameters are then predicted. To perform segmentation, an object-guided coarse-to-fine segmentation branch is built along with the detection branch. The segmentation branch reuses the object features as guidance to separate target object from the neighboring ones within the same bounding box region. To further improve the segmentation quality, we design an auxiliary feature refinement module that densely samples and refines point-wise features in the boundary regions. Experimental results on three biological image datasets demonstrate the advantages of our method. The code will be available at https://github.com/yijingru/ObjGuided-Instance-Segmentation.


Assuntos
Interpretação de Imagem Assistida por Computador , Processamento de Imagem Assistida por Computador , Aprendizado de Máquina , Redes Neurais de Computação
17.
BMC Bioinformatics ; 11: 549, 2010 Nov 08.
Artigo em Inglês | MEDLINE | ID: mdl-21059237

RESUMO

BACKGROUND: In recent years, the number of High Throughput Screening (HTS) assays deposited in PubChem has grown quickly. As a result, the volume of both the structured information (i.e. molecular structure, bioactivities) and the unstructured information (such as descriptions of bioassay experiments), has been increasing exponentially. As a result, it has become even more demanding and challenging to efficiently assemble the bioactivity data by mining the huge amount of information to identify and interpret the relationships among the diversified bioassay experiments. In this work, we propose a text-mining based approach for bioassay neighboring analysis from the unstructured text descriptions contained in the PubChem BioAssay database. RESULTS: The neighboring analysis is achieved by evaluating the cosine scores of each bioassay pair and fraction of overlaps among the human-curated neighbors. Our results from the cosine score distribution analysis and assay neighbor clustering analysis on all PubChem bioassays suggest that strong correlations among the bioassays can be identified from their conceptual relevance. A comparison with other existing assay neighboring methods suggests that the text-mining based bioassay neighboring approach provides meaningful linkages among the PubChem bioassays, and complements the existing methods by identifying additional relationships among the bioassay entries. CONCLUSIONS: The text-mining based bioassay neighboring analysis is efficient for correlating bioassays and studying different aspects of a biological process, which are otherwise difficult to achieve by existing neighboring procedures due to the lack of specific annotations and structured information. It is suggested that the text-mining based bioassay neighboring analysis can be used as a standalone or as a complementary tool for the PubChem bioassay neighboring process to enable efficient integration of assay results and generate hypotheses for the discovery of bioactivities of the tested reagents.


Assuntos
Mineração de Dados/métodos , Ensaios de Triagem em Larga Escala , Bioensaio , Bases de Dados Factuais
18.
Bioinformatics ; 25(17): 2251-5, 2009 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-19549631

RESUMO

This work provides an analysis of across-target bioactivity results in the screening data deposited in PubChem. Two alternative approaches for grouping-related targets are used to examine a compound's across-target bioactivity. This analysis identifies compounds that are selectively active against groups of protein targets that are identical or similar in sequence. This analysis also identifies compounds that are bioactive across unrelated targets. Statistical distributions of compound' across-target selectivity provide a survey to evaluate target specificity of compounds by deriving and analyzing bioactivity profile across a wide range of biological targets for tested small molecules in PubChem. This work enables one to select target specific inhibitors, identify promiscuous compounds and better understand the biological mechanisms of target-small molecule interactions.


Assuntos
Coleta de Dados , Bases de Dados Factuais , Avaliação Pré-Clínica de Medicamentos , Bibliotecas de Moléculas Pequenas/farmacologia , Análise por Conglomerados
19.
J Pharmacol Exp Ther ; 330(1): 304-15, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19357322

RESUMO

Low target discovery rate has been linked to inadequate consideration of multiple factors that collectively contribute to druggability. These factors include sequence, structural, physicochemical, and systems profiles. Methods individually exploring each of these profiles for target identification have been developed, but they have not been collectively used. We evaluated the collective capability of these methods in identifying promising targets from 1019 research targets based on the multiple profiles of up to 348 successful targets. The collective method combining at least three profiles identified 50, 25, 10, and 4% of the 30, 84, 41, and 864 phase III, II, I, and nonclinical trial targets as promising, including eight to nine targets of positive phase III results. This method dropped 89% of the 19 discontinued clinical trial targets and 97% of the 65 targets failed in high-throughput screening or knockout studies. Collective consideration of multiple profiles demonstrated promising potential in identifying innovative targets.


Assuntos
Fenômenos Químicos/efeitos dos fármacos , Sistemas de Liberação de Medicamentos/métodos , Descoberta de Drogas/métodos , Marcação de Genes/métodos , Animais , Ensaios Clínicos como Assunto/métodos , Ensaios Clínicos como Assunto/tendências , Sistemas de Liberação de Medicamentos/tendências , Desenho de Fármacos , Descoberta de Drogas/tendências , Marcação de Genes/tendências , Humanos , Relação Estrutura-Atividade
20.
BMC Bioinformatics ; 9: 401, 2008 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-18817552

RESUMO

BACKGROUND: Recent advances in high-throughput screening (HTS) techniques and readily available compound libraries generated using combinatorial chemistry or derived from natural products enable the testing of millions of compounds in a matter of days. Due to the amount of information produced by HTS assays, it is a very challenging task to mine the HTS data for potential interest in drug development research. Computational approaches for the analysis of HTS results face great challenges due to the large quantity of information and significant amounts of erroneous data produced. RESULTS: In this study, Decision Trees (DT) based models were developed to discriminate compound bioactivities by using their chemical structure fingerprints provided in the PubChem system http://pubchem.ncbi.nlm.nih.gov. The DT models were examined for filtering biological activity data contained in four assays deposited in the PubChem Bioassay Database including assays tested for 5HT1a agonists, antagonists, and HIV-1 RT-RNase H inhibitors. The 10-fold Cross Validation (CV) sensitivity, specificity and Matthews Correlation Coefficient (MCC) for the models are 57.2 approximately 80.5%, 97.3 approximately 99.0%, 0.4 approximately 0.5 respectively. A further evaluation was also performed for DT models built for two independent bioassays, where inhibitors for the same HIV RNase target were screened using different compound libraries, this experiment yields enrichment factor of 4.4 and 9.7. CONCLUSION: Our results suggest that the designed DT models can be used as a virtual screening technique as well as a complement to traditional approaches for hits selection.


Assuntos
Bioensaio/métodos , Bioensaio/estatística & dados numéricos , Árvores de Decisões , Armazenamento e Recuperação da Informação/métodos , Reconhecimento Automatizado de Padrão/métodos , Relação Quantitativa Estrutura-Atividade , Inteligência Artificial , Técnicas de Apoio para a Decisão , Descoberta de Drogas/métodos , Ribonuclease H do Vírus da Imunodeficiência Humana/antagonistas & inibidores , Sensibilidade e Especificidade , Proteínas da Membrana Plasmática de Transporte de Serotonina/agonistas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA