Evaluation of a machine-learning model based on laboratory parameters for the prediction of acute leukaemia subtypes: a multicentre model development and validation study in France.

Alcazer, Vincent; Le Meur, Grégoire; Roccon, Marie; Barriere, Sabrina; Le Calvez, Baptiste; Badaoui, Bouchra; Spaeth, Agathe; Kosmider, Olivier; Freynet, Nicolas; Eveillard, Marion; Croizier, Carolyne; Chevalier, Simon; Sujobert, Pierre

Alcazer, Vincent; Le Meur, Grégoire; Roccon, Marie; Barriere, Sabrina; Le Calvez, Baptiste; Badaoui, Bouchra; Spaeth, Agathe; Kosmider, Olivier; Freynet, Nicolas; Eveillard, Marion; Croizier, Carolyne; Chevalier, Simon; Sujobert, Pierre.

Afiliação

Alcazer V; Department of Clinical Hematology, Hospices Civils de Lyon, Hôpital Lyon Sud, Lyon, France; International Center for Infectiology Research, Inserm U1111, Lyon, France. Electronic address: vincent.alcazer@chu-lyon.fr.
Le Meur G; Department of Clinical Hematology, Hospices Civils de Lyon, Hôpital Lyon Sud, Lyon, France.
Roccon M; Laboratory of Hematology, Centre Hospitalier Universitaire Grenoble Alpes, Grenoble, France.
Barriere S; Department of Clinical Hematology, Centre Hospitalier Universitaire de Clermont-Ferrand, Clermont-Ferrand, France.
Le Calvez B; Pediatric Oncology, Centre Hospitalier Universitaire de Nantes, Nantes, France.
Badaoui B; Department of Biological Hematology and Immunology, Assistance Publique-Hôpitaux de Paris, Hôpitaux Universitaires Henri Mondor, Paris, France.
Spaeth A; Laboratory of Hematology, Assistance Publique-Hôpitaux de Paris, Hôpital Cochin, Paris, France.
Kosmider O; Laboratory of Hematology, Assistance Publique-Hôpitaux de Paris, Hôpital Cochin, Paris, France.
Freynet N; Department of Biological Hematology and Immunology, Assistance Publique-Hôpitaux de Paris, Hôpitaux Universitaires Henri Mondor, Paris, France.
Eveillard M; Pediatric Oncology, Centre Hospitalier Universitaire de Nantes, Nantes, France.
Croizier C; Department of Clinical Hematology, Centre Hospitalier Universitaire de Clermont-Ferrand, Clermont-Ferrand, France.
Chevalier S; Laboratory of Hematology, Centre Hospitalier Universitaire Grenoble Alpes, Grenoble, France.
Sujobert P; Laboratory of Hematology, Hospices Civils de Lyon, Hôpital Lyon Sud, Lyon, France; International Center for Infectiology Research, Inserm U1111, Lyon, France.

Lancet Digit Health ; 6(5): e323-e333, 2024 May.

Article em En | MEDLINE | ID: mdl-38670741

ABSTRACT

ABSTRACT

BACKGROUND:

Acute leukaemias are life-threatening haematological cancers characterised by the infiltration of transformed immature haematopoietic cells in the blood and bone marrow. Prompt and accurate diagnosis of the three main acute leukaemia subtypes (ie acute lymphocytic leukaemia [ALL], acute myeloid leukaemia [AML], and acute promyelocytic leukaemia [APL]) is of utmost importance to guide initial treatment and prevent early mortality but requires cytological expertise that is not always available. We aimed to benchmark different machine-learning strategies using a custom variable selection algorithm to propose an extreme gradient boosting model to predict leukaemia subtypes on the basis of routine laboratory parameters.

METHODS:

This multicentre model development and validation study was conducted with data from six independent French university hospital databases. Patients aged 18 years or older diagnosed with AML, APL, or ALL in any one of these six hospital databases between March 1, 2012, and Dec 31, 2021, were recruited. 22 routine parameters were collected at the time of initial disease evaluation; variables with more than 25% of missing values in two datasets were not used for model training, leading to the final inclusion of 19 parameters. The performances of the final model were evaluated on internal testing and external validation sets with area under the receiver operating characteristic curves (AUCs), and clinically relevant cutoffs were chosen to guide clinical decision making. The final tool, Artificial Intelligence Prediction of Acute Leukemia (AI-PAL), was developed from this model.

FINDINGS:

1410 patients diagnosed with AML, APL, or ALL were included. Data quality control showed few missing values for each cohort, with the exception of uric acid and lactate dehydrogenase for the cohort from Hôpital Cochin. 679 patients from Hôpital Lyon Sud and Centre Hospitalier Universitaire de Clermont-Ferrand were split into the training (n=477) and internal testing (n=202) sets. 731 patients from the four other cohorts were used for external validation. Overall AUCs across all validation cohorts were 0·97 (95% CI 0·95-0·99) for APL, 0·90 (0·83-0·97) for ALL, and 0·89 (0·82-0·95) for AML. Cutoffs were then established on the overall cohort of 1410 patients to guide clinical decisions. Confident cutoffs showed two (0·14%) wrong predictions for ALL, four (0·28%) wrong predictions for APL, and three (0·21%) wrong predictions for AML. Use of the overall cutoff greatly reduced the number of missing predictions; diagnosis was proposed for 1375 (97·5%) of 1410 patients for each category, with only a slight increase in wrong predictions. The final model evaluation across both the internal testing and external validation sets showed accuracy of 99·5% for ALL diagnosis, 98·8% for AML diagnosis, and 99·7% for APL diagnosis in the confident model and accuracy of 87·9% for ALL diagnosis, 86·3% for AML diagnosis, and 96·1% for APL diagnosis in the overall model.

INTERPRETATION:

AI-PAL allowed for accurate diagnosis of the three main acute leukaemia subtypes. Based on ten simple laboratory parameters, its broad availability could help guide initial therapies in a context where cytological expertise is lacking, such as in low-income countries.

FUNDING:

None.

Assuntos

Leucemia Mieloide Aguda; Aprendizado de Máquina; Humanos; França; Leucemia Mieloide Aguda/diagnóstico; Feminino; Masculino; Pessoa de Meia-Idade; Adulto; Idoso; Leucemia-Linfoma Linfoblástico de Células Precursoras/diagnóstico; Leucemia Promielocítica Aguda/diagnóstico; Algoritmos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Limite: Adult / Aged / Female / Humans / Male / Middle aged País/Região como assunto: Europa Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google