Your browser doesn't support javascript.
loading
Machine-Learning Assisted Screening of Correlated Covariates: Application to Clinical Data of Desipramine.
Asiimwe, Innocent Gerald; S'fiso Ndzamba, Bonginkosi; Mouksassi, Samer; Pillai, Goonaseelan Colin; Lombard, Aurelie; Lang, Jennifer.
Afiliação
  • Asiimwe IG; The Wolfson Centre for Personalized Medicine, Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, UK. i.asiimwe@liverpool.ac.uk.
  • S'fiso Ndzamba B; APT-Africa Fellowship Program, c/o Pharmacometrics Africa NPC, K45 Old Main Building, Groote Schuur Hospital, Cape Town, South Africa. i.asiimwe@liverpool.ac.uk.
  • Mouksassi S; APT-Africa Fellowship Program, c/o Pharmacometrics Africa NPC, K45 Old Main Building, Groote Schuur Hospital, Cape Town, South Africa.
  • Pillai GC; Faculty of health sciences, Department of Pharmacy, Nelson Mandela University, Port Elizabeth, South Africa.
  • Lombard A; Certara, Cairo, Egypt.
  • Lang J; APT-Africa Fellowship Program, c/o Pharmacometrics Africa NPC, K45 Old Main Building, Groote Schuur Hospital, Cape Town, South Africa.
AAPS J ; 26(4): 63, 2024 05 30.
Article em En | MEDLINE | ID: mdl-38816519
ABSTRACT
Stepwise covariate modeling (SCM) has a high computational burden and can select the wrong covariates. Machine learning (ML) has been proposed as a screening tool to improve the efficiency of covariate selection, but little is known about how to apply ML on actual clinical data. First, we simulated datasets based on clinical data to compare the performance of various ML and traditional pharmacometrics (PMX) techniques with and without accounting for highly-correlated covariates. This simulation step identified the ML algorithm and the number of top covariates to select when using the actual clinical data. A previously developed desipramine population-pharmacokinetic model was used to simulate virtual subjects. Fifteen covariates were considered with four having an effect included. Based on the F1 score (an accuracy measure), ridge regression was the most accurate ML technique on 200 simulated datasets (F1 score = 0.475 ± 0.231), a performance which almost doubled when highly-correlated covariates were accounted for (F1 score = 0.860 ± 0.158). These performances were better than forwards selection with SCM (F1 score = 0.251 ± 0.274 and 0.499 ± 0.381 without/with correlations respectively). In terms of computational cost, ridge regression (0.42 ± 0.07 seconds/simulated dataset, 1 thread) was ~20,000 times faster than SCM (2.30 ± 2.29 hours, 15 threads). On the clinical dataset, prescreening with the selected ML algorithm reduced SCM runtime by 42.86% (from 1.75 to 1.00 days) and produced the same final model as SCM only. In conclusion, we have demonstrated that accounting for highly-correlated covariates improves ML prescreening accuracy. The choice of ML method and the proportion of important covariates (unknown a priori) can be guided by simulations.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Desipramina / Aprendizado de Máquina Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Desipramina / Aprendizado de Máquina Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article