Your browser doesn't support javascript.
loading
A transfer learning approach via procrustes analysis and mean shift for cancer drug sensitivity prediction.
Turki, Turki; Wei, Zhi; Wang, Jason T L.
Afiliación
  • Turki T; * Department of Computer Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia.
  • Wei Z; † Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA.
  • Wang JTL; † Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA.
J Bioinform Comput Biol ; 16(3): 1840014, 2018 06.
Article en En | MEDLINE | ID: mdl-29945499
ABSTRACT
Transfer learning (TL) algorithms aim to improve the prediction performance in a target task (e.g. the prediction of cisplatin sensitivity in triple-negative breast cancer patients) via transferring knowledge from auxiliary data of a related task (e.g. the prediction of docetaxel sensitivity in breast cancer patients), where the distribution and even the feature space of the data pertaining to the tasks can be different. In real-world applications, we sometimes have a limited training set in a target task while we have auxiliary data from a related task. To obtain a better prediction performance in the target task, supervised learning requires a sufficiently large training set in the target task to perform well in predicting future test examples of the target task. In this paper, we propose a TL approach for cancer drug sensitivity prediction, where our approach combines three techniques. First, we shift the representation of a subset of examples from auxiliary data of a related task to a representation closer to a target training set of a target task. Second, we align the shifted representation of the selected examples of the auxiliary data to the target training set to obtain examples with representation aligned to the target training set. Third, we train machine learning algorithms using both the target training set and the aligned examples. We evaluate the performance of our approach against baseline approaches using the Area Under the receiver operating characteristic (ROC) Curve (AUC) on real clinical trial datasets pertaining to multiple myeloma, nonsmall cell lung cancer, triple-negative breast cancer, and breast cancer. Experimental results show that our approach is better than the baseline approaches in terms of performance and statistical significance.
Asunto(s)
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Algoritmos / Biología Computacional / Antineoplásicos Tipo de estudio: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Bioinform Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2018 Tipo del documento: Article

Texto completo: 1 Base de datos: MEDLINE Asunto principal: Algoritmos / Biología Computacional / Antineoplásicos Tipo de estudio: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Bioinform Comput Biol Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2018 Tipo del documento: Article