Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 55
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
BMC Bioinformatics ; 25(1): 59, 2024 Feb 06.
Artículo en Inglés | MEDLINE | ID: mdl-38321386

RESUMEN

The prediction of interactions between novel drugs and biological targets is a vital step in the early stage of the drug discovery pipeline. Many deep learning approaches have been proposed over the last decade, with a substantial fraction of them sharing the same underlying two-branch architecture. Their distinction is limited to the use of different types of feature representations and branches (multi-layer perceptrons, convolutional neural networks, graph neural networks and transformers). In contrast, the strategy used to combine the outputs (embeddings) of the branches has remained mostly the same. The same general architecture has also been used extensively in the area of recommender systems, where the choice of an aggregation strategy is still an open question. In this work, we investigate the effectiveness of three different embedding aggregation strategies in the area of drug-target interaction (DTI) prediction. We formally define these strategies and prove their universal approximator capabilities. We then present experiments that compare the different strategies on benchmark datasets from the area of DTI prediction, showcasing conditions under which specific strategies could be the obvious choice.


Asunto(s)
Benchmarking , Descubrimiento de Drogas , Suministros de Energía Eléctrica , Redes Neurales de la Computación
2.
Brief Bioinform ; 21(1): 262-271, 2020 Jan 17.
Artículo en Inglés | MEDLINE | ID: mdl-30329015

RESUMEN

Supervised machine learning techniques have traditionally been very successful at reconstructing biological networks, such as protein-ligand interaction, protein-protein interaction and gene regulatory networks. Many supervised techniques for network prediction use linear models on a possibly nonlinear pairwise feature representation of edges. Recently, much emphasis has been placed on the correct evaluation of such supervised models. It is vital to distinguish between using a model to either predict new interactions in a given network or to predict interactions for a new vertex not present in the original network. This distinction matters because (i) the performance might dramatically differ between the prediction settings and (ii) tuning the model hyperparameters to obtain the best possible model depends on the setting of interest. Specific cross-validation schemes need to be used to assess the performance in such different prediction settings. In this work we discuss a state-of-the-art kernel-based network inference technique called two-step kernel ridge regression. We show that this regression model can be trained efficiently, with a time complexity scaling with the number of vertices rather than the number of edges. Furthermore, this framework leads to a series of cross-validation shortcuts that allow one to rapidly estimate the model performance for any relevant network prediction setting. This allows computational biologists to fully assess the capabilities of their models. The machine learning techniques with the algebraic shortcuts are implemented in the RLScore software package: https://github.com/aatapa/RLScore.

3.
Bioinformatics ; 37(Suppl_1): i93-i101, 2021 07 12.
Artículo en Inglés | MEDLINE | ID: mdl-34252952

RESUMEN

MOTIVATION: Combination therapies have emerged as a powerful treatment modality to overcome drug resistance and improve treatment efficacy. However, the number of possible drug combinations increases very rapidly with the number of individual drugs in consideration, which makes the comprehensive experimental screening infeasible in practice. Machine-learning models offer time- and cost-efficient means to aid this process by prioritizing the most effective drug combinations for further pre-clinical and clinical validation. However, the complexity of the underlying interaction patterns across multiple drug doses and in different cellular contexts poses challenges to the predictive modeling of drug combination effects. RESULTS: We introduce comboLTR, highly time-efficient method for learning complex, non-linear target functions for describing the responses of therapeutic agent combinations in various doses and cancer cell-contexts. The method is based on a polynomial regression via powerful latent tensor reconstruction. It uses a combination of recommender system-style features indexing the data tensor of response values in different contexts, and chemical and multi-omics features as inputs. We demonstrate that comboLTR outperforms state-of-the-art methods in terms of predictive performance and running time, and produces highly accurate results even in the challenging and practical inference scenario where full dose-response matrices are predicted for completely new drug combinations with no available combination and monotherapy response measurements in any training cell line. AVAILABILITY AND IMPLEMENTATION: comboLTR code is available at https://github.com/aalto-ics-kepaco/ComboLTR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biología Computacional , Neoplasias , Algoritmos , Línea Celular , Combinación de Medicamentos , Humanos
4.
J Magn Reson Imaging ; 55(2): 465-477, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-34227169

RESUMEN

BACKGROUND: Accurate detection of clinically significant prostate cancer (csPCa), Gleason Grade Group ≥ 2, remains a challenge. Prostate MRI radiomics and blood kallikreins have been proposed as tools to improve the performance of biparametric MRI (bpMRI). PURPOSE: To develop and validate radiomics and kallikrein models for the detection of csPCa. STUDY TYPE: Retrospective. POPULATION: A total of 543 men with a clinical suspicion of csPCa, 411 (76%, 411/543) had kallikreins available and 360 (88%, 360/411) did not take 5-alpha-reductase inhibitors. Two data splits into training, validation (split 1: single center, n = 72; split 2: random 50% of pooled datasets from all four centers), and testing (split 1: 4 centers, n = 288; split 2: remaining 50%) were evaluated. FIELD STRENGTH/SEQUENCE: A 3 T/1.5 T, TSE T2-weighted imaging, 3x SE DWI. ASSESSMENT: In total, 20,363 radiomic features calculated from manually delineated whole gland (WG) and bpMRI suspicion lesion masks were evaluated in addition to clinical parameters, prostate-specific antigen, four kallikreins, MRI-based qualitative (PI-RADSv2.1/IMPROD bpMRI Likert) scores. STATISTICAL TESTS: For the detection of csPCa, area under receiver operating curve (AUC) was calculated using the DeLong's method. A multivariate analysis was conducted to determine the predictive power of combining variables. The values of P-value < 0.05 were considered significant. RESULTS: The highest prediction performance was achieved by IMPROD bpMRI Likert and PI-RADSv2.1 score with AUC = 0.85 and 0.85 in split 1, 0.85 and 0.83 in split 2, respectively. bpMRI WG and/or kallikreins demonstrated AUCs ranging from 0.62 to 0.73 in split 1 and from 0.68 to 0.76 in split 2. AUC of bpMRI lesion-derived radiomics model was not statistically different to IMPROD bpMRI Likert score (split 1: AUC = 0.83, P-value = 0.306; split 2: AUC = 0.83, P-value = 0.488). DATA CONCLUSION: The use of radiomics and kallikreins failed to outperform PI-RADSv2.1/IMPROD bpMRI Likert and their combination did not lead to further performance gains. LEVEL OF EVIDENCE: 1 TECHNICAL EFFICACY: Stage 2.


Asunto(s)
Próstata , Neoplasias de la Próstata , Humanos , Imagen por Resonancia Magnética , Masculino , Pelvis , Próstata/diagnóstico por imagen , Neoplasias de la Próstata/diagnóstico por imagen , Estudios Retrospectivos
5.
J Magn Reson Imaging ; 51(5): 1556-1567, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-31750988

RESUMEN

BACKGROUND: Multiparametric MRI of the prostate has been shown to improve the risk stratification of men with an elevated prostate-specific antigen (PSA). However, long acquisition time, high cost, and inter-center/reader variability of a routine prostate multiparametric MRI limit its wider adoption. PURPOSE: To develop and validate nomograms based on unique rapid biparametric MRI (bpMRI) qualitative and quantitative derived variables for prediction of clinically significant cancer (SPCa). STUDY TYPE: Retrospective analyses of single (IMPROD, NCT01864135) and multiinstitution trials (MULTI-IMPROD, NCT02241122). POPULATION: 161 and 338 prospectively enrolled men who completed the IMPROD and MULTI-IMPROD trials, respectively. FIELD STRENGTH/SEQUENCE: IMPROD bpMRI: 3T/1.5T, T2 -weighted imaging, three separate diffusion-weighted imaging (DWI) acquisitions: 1) b-values 0, 100, 200, 300, 500 s/mm2 ; 2) b values 0, 1500 s/mm2 ; 3) values 0, 2000 s/mm2 . ASSESSMENT: The primary endpoint of the combined trial analysis was the diagnostic accuracy of the combination of IMPROD bpMRI and clinical variables for detection of SPCa. STATISTICAL TESTS: Logistic regression models were developed using IMPROD trial data and validated using MULTI-IMPROD trial data. The model's performance was expressed as the area under the curve (AUC) values for the detection of SPCa, defined as ISUP Gleason Grade Group ≥2. RESULTS: A model incorporating clinical variables had an AUC (95% confidence interval) of 0.83 (0.77-0.89) and 0.80 (0.75-0.85) in the development and validation cohorts, respectively. The corresponding values for a model using IMPROD bpMRI findings were 0.93 (0.89-0.97), and 0.88 (0.84-0.92), respectively. Further addition of the quantitative DWI-based score did not improve AUC values (P < 0.05). DATA CONCLUSION: A prediction model using qualitative IMPROD bpMRI findings demonstrated high accuracy for predicting SPCa in men with an elevated PSA. Online risk calculator: http://petiv.utu.fi/multiimprod/ Level of Evidence: 1 Technical Efficacy Stage: 2 J. Magn. Reson. Imaging 2020;51:1556-1567.


Asunto(s)
Nomogramas , Neoplasias de la Próstata , Biopsia , Humanos , Imagen por Resonancia Magnética , Masculino , Antígeno Prostático Específico , Neoplasias de la Próstata/diagnóstico por imagen , Estudios Retrospectivos
6.
J Magn Reson Imaging ; 51(4): 1075-1085, 2020 04.
Artículo en Inglés | MEDLINE | ID: mdl-31566845

RESUMEN

BACKGROUND: Biochemical recurrence (BCR) affects a significant proportion of patients who undergo robotic-assisted laparoscopic prostatectomy (RALP). PURPOSE: To evaluate the performance of a routine clinical prostate multiparametric magnetic resonance imaging (mpMRI) and Decipher genomic classifier score for prediction of biochemical recurrence in patients who underwent RALP. STUDY TYPE: Retrospective cohort study. SUBJECTS: Ninety-one patients who underwent RALP performed by a single surgeon, had mpMRI before RALP, Decipher taken from RALP samples, and prostate specific antigen (PSA) follow-up for >3 years or BCR within 3 years, defined as PSA >0.2 mg/ml. FIELD STRENGTH/SEQUENCE: mpMRI was performed at 27 different institutions using 1.5T (n = 10) or 3T scanners and included T2 w, diffusion-weighted imaging (DWI), or dynamic contrast-enhanced (DCE) MRI. ASSESSMENT: All mpMRI studies were reported by one reader using Prostate Imaging Reporting and Data System v. 2.1 (PI-RADsv2.1) without knowledge of other findings. Eighteen (20%) randomly selected cases were re-reported by reader B to evaluate interreader variability. STATISTICAL TESTS: Univariate and multivariate analysis using greedy feature selection and tournament leave-pair-out cross-validation (TLPOCV) were used to evaluate the performance of various variables for prediction of BCR, which included clinical (three), systematic biopsy (three), surgical (six: RALP Gleason Grade Group [GGG], extracapsular extension, seminal vesicle invasion, intraoperative surgical margins [PSM], final PSM, pTNM), Decipher (two: Decipher score, Decipher risk category), and mpMRI (eight: prostate volume, PSA density, PI-RADv2.1 score, MRI largest lesion size, summed MRI lesions' volume and relative volume [MRI-lesion-percentage], mpMRI ECE, mpMRI seminal vesicle invasion [SVI]) variables. The evaluation metric was the area under the curve (AUC). RESULTS: Forty-eight (53%) patients developed BCR. The best-performing individual features with TLPOCV AUC of 0.73 (95% confidence interval [CI] 0.64-0.82) were RALP GGG, MRI-lesion-percentage followed by biopsy GGG (0.72, 0.62-0.82), and Decipher score (0.71, 0.60-0.82). The best performance was achieved by feature selection of Decipher+Surgery and MRI + Surgery variables with TLPOCV AUC of 0.82 and 0.81, respectively DATA CONCLUSION: Relative lesion volume measured on a routine clinical mpMRI failed to outperform Decipher score in BCR prediction. LEVEL OF EVIDENCE: 3 Technical Efficacy: Stage 2 J. Magn. Reson. Imaging 2020;51:1075-1085.


Asunto(s)
Imágenes de Resonancia Magnética Multiparamétrica , Neoplasias de la Próstata , Genómica , Humanos , Imagen por Resonancia Magnética , Masculino , Prostatectomía , Neoplasias de la Próstata/diagnóstico por imagen , Neoplasias de la Próstata/cirugía , Estudios Retrospectivos
7.
J Magn Reson Imaging ; 51(5): 1540-1553, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-31588660

RESUMEN

BACKGROUND: Accurate risk stratification of men with a clinical suspicion of prostate cancer (cSPCa) remains challenging despite the increasing use of MRI. PURPOSE: To evaluate the diagnostic accuracy of a unique biparametric MRI protocol (IMPROD bpMRI) combined with clinical and molecular markers in men with cSPCa. STUDY TYPE: Prospective single-institutional clinical trial (NCT01864135). SUBJECTS: Eighty men with cSPCa. FIELD STRENGTH/SEQUENCE: 3T, surface array coils. Two T2 -weighted and three diffusion-weighted imaging (DWI) acquisitions: 1) b-values 0, 100, 200, 300, 500 s/mm2 ; 2) b-values 0,1500 s/mm2 ; 3) b-values 0, 2000 s/mm2 . ASSESSMENT: IMPROD bpMRI examinations were qualitatively (IMPROD bpMRI Likert score) and quantitatively (DWI-based Gleason grade score) prospectively reported. Men with IMPROD bpMRI Likert 3-5 had two targeted biopsies followed by 12-core systematic biopsies (SB); those with IMPROD bpMRI Likert 1-2 had only SB. Additionally, 2-core from normal-appearing prostate areas were obtained for the mRNA expression of ACSM1, AMACR, CACNA1D, DLX1, PCA3, PLA2G7, RHOU, SPINK1, SPON2, TMPRSS2-ERG, and TDRD1 measured by quantitative reverse-transcription polymerase chain reaction. STATISTICAL TESTS: Univariate and multivariate analysis using regularized least-squares, feature selection and tournament leave-pair-out cross-validation (TLPOCV), as well as 10 random splits of the data in training-testing sets, were used to evaluate the mRNA, clinical and IMPROD bpMRI parameters in detecting clinically significant prostate cancer (SPCa) defined as Gleason score ≥ 3 + 4. The evaluation metric was the area under the curve (AUC). RESULTS: IMPROD bpMRI Likert demonstrated the highest TLPOCV AUC of 0.92. The tested clinical variables had AUC 0.56-0.73, while the mRNA and additional IMPROD bpMRI parameters had AUC 0.50-0.67 and 0.65-0.89 respectively. The combination of clinical and mRNA biomarkers produced TLPOCV AUC of 0.87, the highest TLPOCV performance without including IMPROD bpMRI Likert. DATA CONCLUSION: The qualitative IMPROD bpMRI Likert score demonstrated the highest accuracy for SPCa detection compared with the tested clinical variables and mRNA biomarkers. LEVEL OF EVIDENCE: 1 Technical Efficacy Stage: 2 J. Magn. Reson. Imaging 2020;51:1540-1553.


Asunto(s)
Neoplasias de la Próstata , Humanos , Aprendizaje Automático , Imagen por Resonancia Magnética , Masculino , Estudios Prospectivos , Neoplasias de la Próstata/diagnóstico por imagen , Neoplasias de la Próstata/genética , Medición de Riesgo , Inhibidor de Tripsina Pancreática de Kazal
8.
J Acoust Soc Am ; 148(5): 3107, 2020 11.
Artículo en Inglés | MEDLINE | ID: mdl-33261368

RESUMEN

Objectively determined single-number-quantities (SNQs) describing the airborne sound insulation of a façade should correspond to the subjective perception of annoyance to road traffic sounds transmitted through a façade. The reference spectra for spectrum adaptation terms C and Ctr in standard ISO 717-7 (International Organization for Standardization, 2013) are not based on psycho-acoustic evidence. The aim of this study is to develop reference spectra which result in SNQs that explain the subjective annoyance of road traffic sounds transmitted through a façade well. Data from a psycho-acoustic experiment by Hongisto, Oliva, and Rekola [J. Acoust. Soc. Am. 144(2), 1100-1112 (2018)] were used. The data included annoyance ratings for road traffic sounds (five different spectrum alternatives) attenuated by the façade (twelve different sound insulation spectrum alternatives), rated by 43 participants. The reference spectrum for each road traffic spectrum was found using mathematical optimization. The performance of the acquired SNQs was estimated with nested cross-validation. The SNQs determined with the optimized reference spectra performed better than the existing SNQs for two road traffic spectra out of five and for an aggregate of the five road traffic sound types. The results can be exploited in the development of standardized SNQs.


Asunto(s)
Ruido , Sonido , Acústica , Humanos , Matemática
9.
Bioinformatics ; 34(13): i509-i518, 2018 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-29949975

RESUMEN

Motivation: Many inference problems in bioinformatics, including drug bioactivity prediction, can be formulated as pairwise learning problems, in which one is interested in making predictions for pairs of objects, e.g. drugs and their targets. Kernel-based approaches have emerged as powerful tools for solving problems of that kind, and especially multiple kernel learning (MKL) offers promising benefits as it enables integrating various types of complex biomedical information sources in the form of kernels, along with learning their importance for the prediction task. However, the immense size of pairwise kernel spaces remains a major bottleneck, making the existing MKL algorithms computationally infeasible even for small number of input pairs. Results: We introduce pairwiseMKL, the first method for time- and memory-efficient learning with multiple pairwise kernels. pairwiseMKL first determines the mixture weights of the input pairwise kernels, and then learns the pairwise prediction function. Both steps are performed efficiently without explicit computation of the massive pairwise matrices, therefore making the method applicable to solving large pairwise learning problems. We demonstrate the performance of pairwiseMKL in two related tasks of quantitative drug bioactivity prediction using up to 167 995 bioactivity measurements and 3120 pairwise kernels: (i) prediction of anticancer efficacy of drug compounds across a large panel of cancer cell lines; and (ii) prediction of target profiles of anticancer compounds across their kinome-wide target spaces. We show that pairwiseMKL provides accurate predictions using sparse solutions in terms of selected kernels, and therefore it automatically identifies also data sources relevant for the prediction problem. Availability and implementation: Code is available at https://github.com/aalto-ics-kepaco. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Antineoplásicos/farmacología , Biología Computacional/métodos , Descubrimiento de Drogas/métodos , Neoplasias/tratamiento farmacológico , Máquina de Vectores de Soporte , Antineoplásicos/uso terapéutico , Línea Celular Tumoral , Humanos , Neoplasias/enzimología , Neoplasias/metabolismo , Proteínas Quinasas/efectos de los fármacos , Proteínas Quinasas/metabolismo , Transducción de Señal , Programas Informáticos , Resultado del Tratamiento
10.
J Magn Reson Imaging ; 50(5): 1641-1650, 2019 11.
Artículo en Inglés | MEDLINE | ID: mdl-30903647

RESUMEN

BACKGROUND: Prostate MRI is increasingly being used in men with a clinical suspicion of prostate cancer (PCa). However, development and validation of methods for focal therapy planning are still lagging. PURPOSE: To evaluate the diagnostic accuracy on lesion, region-of-interest (ROI), and voxel level of IMPROD biparametric prostate MRI (bpMRI) for PCa detection in men with a clinical suspicion of PCa who subsequently underwent radical prostatectomy. STUDY TYPE: Prospective single-institution clinical trial (NCT01864135). POPULATION: Sixty-four men who underwent radical prostatectomy after IMPROD bpMRI performed in prebiopsy settings. FIELD STRENGTH/SEQUENCE: IMPROD bpMRI consisted of T2 -weighted imaging (T2 w) and three separate diffusion-weighted imaging acquisitions with an average acquisition time of 15 minutes. ASSESSMENT: The diagnostic accuracy of prospectively reported manual cancer delineations and regions increased with 3D dilation were evaluated on the voxel level (volume of 1.17 mm3 , 1 mm3 , 125 mm3 ) as well as the 36 ROI level. Only PCa lesions with a diameter ≥ 5 mm or any Gleason Grade 4 were analyzed. All data and protocols are freely available at: http://petiv.utu.fi/improd STATISTICAL TESTS: Sensitivity, specificity, accuracy. RESULTS: In total, 99 PCa lesions were identified. Forty (40%, 40/99) had a Gleason score (GS) of >3 + 4. Twenty-eight PCa lesions (28%, 28/99) were missed by IMPROD bpMRI, three (7.5%, 3/40) with GS >3 + 4. 3D dilation of manual cancer delineations in all directions by ~10-12 mm (corresponding to the Hausdorff distance) was needed to achieve sensitivity approaching 100% on a voxel level. DATA CONCLUSION: IMPROD bpMRI had a high sensitivity on lesion level for PCa with GS >3 + 4. Increasing 3D lesion delineations by ~10-12 mm (corresponding to the Hausdorff distance) was needed to achieve high sensitivity on the voxel level. Such information may help in planning ablation therapies. LEVEL OF EVIDENCE: 1 Technical Efficacy: Stage 2 J. Magn. Reson. Imaging 2019;50:1641-1650.


Asunto(s)
Imagen por Resonancia Magnética/métodos , Prostatectomía , Neoplasias de la Próstata/diagnóstico por imagen , Anciano , Biopsia , Detección Precoz del Cáncer , Humanos , Masculino , Persona de Mediana Edad , Clasificación del Tumor , Estudios Prospectivos , Próstata/diagnóstico por imagen , Antígeno Prostático Específico/análisis , Proyectos de Investigación , Factores de Tiempo
11.
Anal Chem ; 90(7): 4832-4839, 2018 04 03.
Artículo en Inglés | MEDLINE | ID: mdl-29513001

RESUMEN

Raman spectroscopy is widely used for quantitative pharmaceutical analysis, but a common obstacle to its use is sample fluorescence masking the Raman signal. Time-gating provides an instrument-based method for rejecting fluorescence through temporal resolution of the spectral signal and allows Raman spectra of fluorescent materials to be obtained. An additional practical advantage is that analysis is possible in ambient lighting. This study assesses the efficacy of time-gated Raman spectroscopy for the quantitative measurement of fluorescent pharmaceuticals. Time-gated Raman spectroscopy with a 128 × (2) × 4 CMOS SPAD detector was applied for quantitative analysis of ternary mixtures of solid-state forms of the model drug, piroxicam (PRX). Partial least-squares (PLS) regression allowed quantification, with Raman-active time domain selection (based on visual inspection) improving performance. Model performance was further improved by using kernel-based regularized least-squares (RLS) regression with greedy feature selection in which the data use in both the Raman shift and time dimensions was statistically optimized. Overall, time-gated Raman spectroscopy, especially with optimized data analysis in both the spectral and time dimensions, shows potential for sensitive and relatively routine quantitative analysis of photoluminescent pharmaceuticals during drug development and manufacturing.


Asunto(s)
Colorantes Fluorescentes/análisis , Preparaciones Farmacéuticas/análisis , Análisis de los Mínimos Cuadrados , Espectrometría Raman , Factores de Tiempo
12.
Neural Comput ; 30(8): 2245-2283, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29894652

RESUMEN

Many machine learning problems can be formulated as predicting labels for a pair of objects. Problems of that kind are often referred to as pairwise learning, dyadic prediction, or network inference problems. During the past decade, kernel methods have played a dominant role in pairwise learning. They still obtain a state-of-the-art predictive performance, but a theoretical analysis of their behavior has been underexplored in the machine learning literature. In this work we review and unify kernel-based algorithms that are commonly used in different pairwise learning settings, ranging from matrix filtering to zero-shot learning. To this end, we focus on closed-form efficient instantiations of Kronecker kernel ridge regression. We show that independent task kernel ridge regression, two-step kernel ridge regression, and a linear matrix filter arise naturally as a special case of Kronecker kernel ridge regression, implying that all these methods implicitly minimize a squared loss. In addition, we analyze universality, consistency, and spectral filtering properties. Our theoretical results provide valuable insights into assessing the advantages and limitations of existing pairwise learning methods.

13.
PLoS Comput Biol ; 13(8): e1005678, 2017 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-28787438

RESUMEN

Due to relatively high costs and labor required for experimental profiling of the full target space of chemical compounds, various machine learning models have been proposed as cost-effective means to advance this process in terms of predicting the most potent compound-target interactions for subsequent verification. However, most of the model predictions lack direct experimental validation in the laboratory, making their practical benefits for drug discovery or repurposing applications largely unknown. Here, we therefore introduce and carefully test a systematic computational-experimental framework for the prediction and pre-clinical verification of drug-target interactions using a well-established kernel-based regression algorithm as the prediction model. To evaluate its performance, we first predicted unmeasured binding affinities in a large-scale kinase inhibitor profiling study, and then experimentally tested 100 compound-kinase pairs. The relatively high correlation of 0.77 (p < 0.0001) between the predicted and measured bioactivities supports the potential of the model for filling the experimental gaps in existing compound-target interaction maps. Further, we subjected the model to a more challenging task of predicting target interactions for such a new candidate drug compound that lacks prior binding profile information. As a specific case study, we used tivozanib, an investigational VEGF receptor inhibitor with currently unknown off-target profile. Among 7 kinases with high predicted affinity, we experimentally validated 4 new off-targets of tivozanib, namely the Src-family kinases FRK and FYN A, the non-receptor tyrosine kinase ABL1, and the serine/threonine kinase SLK. Our sub-sequent experimental validation protocol effectively avoids any possible information leakage between the training and validation data, and therefore enables rigorous model validation for practical applications. These results demonstrate that the kernel-based modeling approach offers practical benefits for probing novel insights into the mode of action of investigational compounds, and for the identification of new target selectivities for drug repurposing applications.


Asunto(s)
Biología Computacional/métodos , Descubrimiento de Drogas/métodos , Modelos Estadísticos , Inhibidores de Proteínas Quinasas , Algoritmos , Bases de Datos Factuales , Humanos , Unión Proteica , Inhibidores de Proteínas Quinasas/química , Inhibidores de Proteínas Quinasas/metabolismo , Inhibidores de Proteínas Quinasas/farmacología , Reproducibilidad de los Resultados
14.
Lancet Oncol ; 18(1): 132-142, 2017 01.
Artículo en Inglés | MEDLINE | ID: mdl-27864015

RESUMEN

BACKGROUND: Improvements to prognostic models in metastatic castration-resistant prostate cancer have the potential to augment clinical trial design and guide treatment strategies. In partnership with Project Data Sphere, a not-for-profit initiative allowing data from cancer clinical trials to be shared broadly with researchers, we designed an open-data, crowdsourced, DREAM (Dialogue for Reverse Engineering Assessments and Methods) challenge to not only identify a better prognostic model for prediction of survival in patients with metastatic castration-resistant prostate cancer but also engage a community of international data scientists to study this disease. METHODS: Data from the comparator arms of four phase 3 clinical trials in first-line metastatic castration-resistant prostate cancer were obtained from Project Data Sphere, comprising 476 patients treated with docetaxel and prednisone from the ASCENT2 trial, 526 patients treated with docetaxel, prednisone, and placebo in the MAINSAIL trial, 598 patients treated with docetaxel, prednisone or prednisolone, and placebo in the VENICE trial, and 470 patients treated with docetaxel and placebo in the ENTHUSE 33 trial. Datasets consisting of more than 150 clinical variables were curated centrally, including demographics, laboratory values, medical history, lesion sites, and previous treatments. Data from ASCENT2, MAINSAIL, and VENICE were released publicly to be used as training data to predict the outcome of interest-namely, overall survival. Clinical data were also released for ENTHUSE 33, but data for outcome variables (overall survival and event status) were hidden from the challenge participants so that ENTHUSE 33 could be used for independent validation. Methods were evaluated using the integrated time-dependent area under the curve (iAUC). The reference model, based on eight clinical variables and a penalised Cox proportional-hazards model, was used to compare method performance. Further validation was done using data from a fifth trial-ENTHUSE M1-in which 266 patients with metastatic castration-resistant prostate cancer were treated with placebo alone. FINDINGS: 50 independent methods were developed to predict overall survival and were evaluated through the DREAM challenge. The top performer was based on an ensemble of penalised Cox regression models (ePCR), which uniquely identified predictive interaction effects with immune biomarkers and markers of hepatic and renal function. Overall, ePCR outperformed all other methods (iAUC 0·791; Bayes factor >5) and surpassed the reference model (iAUC 0·743; Bayes factor >20). Both the ePCR model and reference models stratified patients in the ENTHUSE 33 trial into high-risk and low-risk groups with significantly different overall survival (ePCR: hazard ratio 3·32, 95% CI 2·39-4·62, p<0·0001; reference model: 2·56, 1·85-3·53, p<0·0001). The new model was validated further on the ENTHUSE M1 cohort with similarly high performance (iAUC 0·768). Meta-analysis across all methods confirmed previously identified predictive clinical variables and revealed aspartate aminotransferase as an important, albeit previously under-reported, prognostic biomarker. INTERPRETATION: Novel prognostic factors were delineated, and the assessment of 50 methods developed by independent international teams establishes a benchmark for development of methods in the future. The results of this effort show that data-sharing, when combined with a crowdsourced challenge, is a robust and powerful framework to develop new prognostic models in advanced prostate cancer. FUNDING: Sanofi US Services, Project Data Sphere.


Asunto(s)
Protocolos de Quimioterapia Combinada Antineoplásica/uso terapéutico , Modelos Estadísticos , Nomogramas , Neoplasias de la Próstata Resistentes a la Castración/mortalidad , Adolescente , Adulto , Anciano , Teorema de Bayes , Colaboración de las Masas , Docetaxel , Estudios de Seguimiento , Humanos , Masculino , Persona de Mediana Edad , Estadificación de Neoplasias , Prednisona/administración & dosificación , Pronóstico , Neoplasias de la Próstata Resistentes a la Castración/tratamiento farmacológico , Neoplasias de la Próstata Resistentes a la Castración/secundario , Tasa de Supervivencia , Taxoides/administración & dosificación , Adulto Joven
15.
Anal Chem ; 89(5): 3208-3216, 2017 03 07.
Artículo en Inglés | MEDLINE | ID: mdl-28194955

RESUMEN

Methods for simple and fast detection and differentiation of bacterial species are required, for instance, in medicine, water quality monitoring, and the food industry. Here, we have developed a novel label array method for the counting and differentiation of bacterial species. This method is based on the nonspecific interactions of multiple unstable lanthanide chelates and selected chemicals within the sample leading to a luminescence signal profile that is unique to the bacterial species. It is simple, cost-effective, and/or user-friendly compared to many existing methods, such as plate counts on selective media, automatic (hemocytometer-based) cell counters, flow cytometry, and polymerase chain reaction (PCR)-based methods for identification. The performance of the method was demonstrated with nine single strains of bacteria in pure culture. The limit of detection for counting was below 1000 bacteria per mL, with an average coefficient of variation of 10% achieved with the developed label array. A predictive model was trained with the measured luminescence signals and its ability to differentiate all tested bacterial species from each other, including members of the same genus Bacillus licheniformis and Bacillus subtilis, was confirmed via leave-one-out cross-validation. The suitability of the method for analysis of mixtures of bacterial species was shown with ternary mixtures of Bacillus licheniformis, Escherichia coli JM109, and Lactobacillus reuteri ATCC PTA 4659. The potential future application of the method could be monitoring for contamination in pure cultures; analysis of mixed bacterial cultures, where examining one species in the presence of another could inform industrial microbial processes; and the analysis of bacterial biofilms, where nonspecific methods based on physical and chemical characteristics are required instead of methods specific to individual bacterial species.


Asunto(s)
Bacterias/aislamiento & purificación , Citometría de Flujo/métodos , Colorantes Fluorescentes/química , Bacillus/química , Bacillus/aislamiento & purificación , Bacillus/metabolismo , Bacterias/química , Bacterias/metabolismo , Complejos de Coordinación/química , Escherichia coli/química , Escherichia coli/aislamiento & purificación , Escherichia coli/metabolismo , Europio/química , Análisis de Componente Principal
16.
Brief Bioinform ; 16(2): 325-37, 2015 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-24723570

RESUMEN

A number of supervised machine learning models have recently been introduced for the prediction of drug-target interactions based on chemical structure and genomic sequence information. Although these models could offer improved means for many network pharmacology applications, such as repositioning of drugs for new therapeutic uses, the prediction models are often being constructed and evaluated under overly simplified settings that do not reflect the real-life problem in practical applications. Using quantitative drug-target bioactivity assays for kinase inhibitors, as well as a popular benchmarking data set of binary drug-target interactions for enzyme, ion channel, nuclear receptor and G protein-coupled receptor targets, we illustrate here the effects of four factors that may lead to dramatic differences in the prediction results: (i) problem formulation (standard binary classification or more realistic regression formulation), (ii) evaluation data set (drug and target families in the application use case), (iii) evaluation procedure (simple or nested cross-validation) and (iv) experimental setting (whether training and test sets share common drugs and targets, only drugs or targets or neither). Each of these factors should be taken into consideration to avoid reporting overoptimistic drug-target interaction prediction results. We also suggest guidelines on how to make the supervised drug-target interaction prediction studies more realistic in terms of such model formulations and evaluation setups that better address the inherent complexity of the prediction task in the practical applications, as well as novel benchmarking data sets that capture the continuous nature of the drug-target interactions for kinase inhibitors.


Asunto(s)
Descubrimiento de Drogas/estadística & datos numéricos , Biología Computacional , Bases de Datos Farmacéuticas/estadística & datos numéricos , Humanos , Modelos Biológicos , Modelos Estadísticos , Relación Estructura-Actividad Cuantitativa , Aprendizaje Automático Supervisado/estadística & datos numéricos
17.
Magn Reson Med ; 77(3): 1249-1264, 2017 03.
Artículo en Inglés | MEDLINE | ID: mdl-26924552

RESUMEN

PURPOSE: To evaluate different fitting methods for intravoxel incoherent motion (IVIM) imaging of prostate cancer in the terms of repeatability and Gleason score prediction. METHODS: Eighty-one patients with histologically confirmed prostate cancer underwent two repeated 3 Tesla diffusion-weighted imaging (DWI) examinations performed using 14 b-values in the range of 0-500 s/mm2 and diffusion time of 19.004 ms. Mean signal intensities of regions-of-interest were fitted using five different fitting methods for IVIM as well as monoexponential, kurtosis, and stretched exponential models. The fitting methods and models were evaluated in the terms of fitting quality [Akaike information criteria (AIC)], repeatability, and Gleason score prediction. Tumors were classified into three groups (3 + 3, 3 + 4, > 3 + 4). Machine learning algorithms were used to evaluate the performance of the combined use of the parameters. Simulation studies were performed to evaluate robustness of the fitting methods against noise. RESULTS: Monoexponential model was preferred over IVIM based on AIC. The "pseudodiffusion" parameters demonstrated low repeatability and clinical value. Median "pseudodiffusion" fraction values were below 8.00%. Combined use of the parameters did not outperform the monoexponential model. CONCLUSION: Monoexponential model demonstrated the highest repeatability and clinical values in the regions-of-interest based analysis of prostate cancer DWI, b-values in the range of 0-500 s/mm2 . Magn Reson Med 77:1249-1264, 2017. © 2016 International Society for Magnetic Resonance in Medicine.


Asunto(s)
Artefactos , Imagen de Difusión por Resonancia Magnética/métodos , Aumento de la Imagen/métodos , Interpretación de Imagen Asistida por Computador/métodos , Neoplasias de la Próstata/diagnóstico por imagen , Neoplasias de la Próstata/patología , Algoritmos , Simulación por Computador , Humanos , Aprendizaje Automático , Masculino , Modelos Biológicos , Modelos Estadísticos , Movimiento (Física) , Análisis Numérico Asistido por Computador , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Técnica de Sustracción
18.
Scand J Gastroenterol ; 52(12): 1348-1353, 2017 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-28838273

RESUMEN

BACKGROUND: Clinical use of biosimilar infliximab (CT-P13) in inflammatory bowel diseases (IBDs) is based on extrapolation of indication from clinical studies performed in rheumatological diseases. Only few data exist of behaviour of infliximab trough levels (TLs) and anti-drug antibodies (ADAs) during switching. AIM: The objective of this study was to evaluate changes in TLs, ADA formation and disease activity after switching from originator infliximab to biosimilar one. METHODS: All our IBD patients receiving maintenance infliximab therapy were switched to biosimilar infliximab. TLs and ADAs were measured before the last originator infusion and before the third biosimilar infusion. Laboratory values, disease activity indices (partial Mayo score and Harvey-Bradshaw index) and demographic data were collected from patient records. RESULTS: A total of 62 patients were included in the final analysis (32 Crohn's disease, 30 ulcerative colitis (UC) or IBD-unclassified). No significant changes in median TLs before (5.5 mg/l) and after switching (5.5 mg/l, p = .05) occurred in the entire study group or in the Crohn's disease (CD) subgroup (5.75 and 6.5 mg/l, p = .68). However, in the subgroup of ulcerative colitis, the change in median TL was significantly different (from 5.2 to 4.25 mg/l, p = .019). Two patients developed ADAs after switching. No changes in disease activity were detected during switching and no safety concerns occurred. CONCLUSIONS: Switching from originator to biosimilar infliximab resulted in statistically significant differences in infliximab TLs in patients with UC but not in patients with Crohn's disease. The clinical significance for this difference is doubtful and in neither group changes in disease activity occurred.


Asunto(s)
Anticuerpos Monoclonales/uso terapéutico , Colitis Ulcerosa/tratamiento farmacológico , Fármacos Gastrointestinales/uso terapéutico , Infliximab/uso terapéutico , Adulto , Biosimilares Farmacéuticos/uso terapéutico , Proteína C-Reactiva/análisis , Enfermedad de Crohn/tratamiento farmacológico , Sustitución de Medicamentos , Femenino , Humanos , Masculino , Persona de Mediana Edad , Inducción de Remisión , Resultado del Tratamiento
19.
Anal Chem ; 88(10): 5271-80, 2016 05 17.
Artículo en Inglés | MEDLINE | ID: mdl-27086705

RESUMEN

Quantification and identification of metal ions has gained interest in drinking water and environmental analyses. We have developed a novel label array method for the quantification and identification of metal ions in drinking water. This simple ready-to-go method is based on the nonspecific interactions of multiple unstable lanthanide chelates and nonantenna ligands with sample leading to a luminescence signal profile, unique to the sample components. The limit of detection at ppb concentration level and average coefficient of variation of 10% were achieved with the developed label array. The identification of 15 different metal ions including different oxidation states Cr(3+)/Cr(6+), Cu(+)/Cu(2+), Fe(2+)/Fe(3+), and Pb(2+)/Pb(4+) was demonstrated. Moreover, a binary mixture of Cu(2+) and Fe(3+) and ternary mixture of Cd(2+), Ni(2+), and Pb(2+) were measured and individual ions were distinguished.

20.
Magn Reson Med ; 74(4): 1116-24, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25329932

RESUMEN

PURPOSE: To evaluate four mathematical models for diffusion weighted imaging (DWI) of prostate cancer (PCa) in terms of PCa detection and characterization. METHODS: Fifty patients with histologically confirmed PCa underwent two repeated 3 Tesla DWI examinations using 12 equally distributed b values, the highest b value of 2000 s/mm(2) . Normalized mean signal intensities of regions-of-interest were fitted using monoexponential, kurtosis, stretched exponential, and biexponential models. Tumors were classified into low, intermediate, and high Gleason score groups. Areas under receiver operating characteristic curve (AUCs) were estimated to evaluate performance in PCa detection and Gleason score classifications. The fitted parameters were correlated with Gleason score groups by using the Spearman correlation coefficient (ρ). Coefficient of repeatability and intraclass correlation coefficient [specifically ICC(3,1)], were calculated to evaluate repeatability of the fitted parameters. RESULTS: The AUC and ρ values were similar between parameters of monoexponential, kurtosis, and stretched exponential (with the exception of the α parameter) models. The absolute ρ values for ADCm , ADCk , K, and ADCs were in the range from 0.31 to 0.53 (P < 0.01). Parameters of the biexponential model demonstrated low repeatability. CONCLUSION: In region-of-interest based analysis, the monoexponential model for DWI of PCa using b values up to 2000 s/mm(2) was sufficient for PCa detection and characterization.


Asunto(s)
Imagen de Difusión por Resonancia Magnética/métodos , Interpretación de Imagen Asistida por Computador/métodos , Neoplasias de la Próstata/diagnóstico , Neoplasias de la Próstata/patología , Anciano , Área Bajo la Curva , Humanos , Masculino , Persona de Mediana Edad , Clasificación del Tumor , Reproducibilidad de los Resultados
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA