Búsqueda | Portal Regional de la BVS

Deep learning to automate the labelling of head MRI datasets for computer vision applications.

Wood, David A; Kafiabadi, Sina; Al Busaidi, Aisha; Guilhem, Emily L; Lynch, Jeremy; Townend, Matthew K; Montvila, Antanas; Kiik, Martin; Siddiqui, Juveria; Gadapa, Naveen; Benger, Matthew D; Mazumder, Asif; Barker, Gareth; Ourselin, Sebastian; Cole, James H; Booth, Thomas C.

Eur Radiol ; 32(1): 725-736, 2022 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-34286375

RESUMEN

OBJECTIVES: The purpose of this study was to build a deep learning model to derive labels from neuroradiology reports and assign these to the corresponding examinations, overcoming a bottleneck to computer vision model development. METHODS: Reference-standard labels were generated by a team of neuroradiologists for model training and evaluation. Three thousand examinations were labelled for the presence or absence of any abnormality by manually scrutinising the corresponding radiology reports ('reference-standard report labels'); a subset of these examinations (n = 250) were assigned 'reference-standard image labels' by interrogating the actual images. Separately, 2000 reports were labelled for the presence or absence of 7 specialised categories of abnormality (acute stroke, mass, atrophy, vascular abnormality, small vessel disease, white matter inflammation, encephalomalacia), with a subset of these examinations (n = 700) also assigned reference-standard image labels. A deep learning model was trained using labelled reports and validated in two ways: comparing predicted labels to (i) reference-standard report labels and (ii) reference-standard image labels. The area under the receiver operating characteristic curve (AUC-ROC) was used to quantify model performance. Accuracy, sensitivity, specificity, and F1 score were also calculated. RESULTS: Accurate classification (AUC-ROC > 0.95) was achieved for all categories when tested against reference-standard report labels. A drop in performance (ΔAUC-ROC > 0.02) was seen for three categories (atrophy, encephalomalacia, vascular) when tested against reference-standard image labels, highlighting discrepancies in the original reports. Once trained, the model assigned labels to 121,556 examinations in under 30 min. CONCLUSIONS: Our model accurately classifies head MRI examinations, enabling automated dataset labelling for downstream computer vision applications. KEY POINTS: â¢ Deep learning is poised to revolutionise image recognition tasks in radiology; however, a barrier to clinical adoption is the difficulty of obtaining large labelled datasets for model training. â¢ We demonstrate a deep learning model which can derive labels from neuroradiology reports and assign these to the corresponding examinations at scale, facilitating the development of downstream computer vision models. â¢ We rigorously tested our model by comparing labels predicted on the basis of neuroradiology reports with two sets of reference-standard labels: (1) labels derived by manually scrutinising each radiology report and (2) labels derived by interrogating the actual images.

Asunto(s)

Aprendizaje Profundo , Área Bajo la Curva , Humanos , Imagen por Resonancia Magnética , Radiografía , Radiólogos

A systematic review of machine learning models for predicting outcomes of stroke with structured data.

Wang, Wenjuan; Kiik, Martin; Peek, Niels; Curcin, Vasa; Marshall, Iain J; Rudd, Anthony G; Wang, Yanzhong; Douiri, Abdel; Wolfe, Charles D; Bray, Benjamin.

PLoS One ; 15(6): e0234722, 2020.

Artículo en Inglés | MEDLINE | ID: mdl-32530947

RESUMEN

BACKGROUND AND PURPOSE: Machine learning (ML) has attracted much attention with the hope that it could make use of large, routinely collected datasets and deliver accurate personalised prognosis. The aim of this systematic review is to identify and critically appraise the reporting and developing of ML models for predicting outcomes after stroke. METHODS: We searched PubMed and Web of Science from 1990 to March 2019, using previously published search filters for stroke, ML, and prediction models. We focused on structured clinical data, excluding image and text analysis. This review was registered with PROSPERO (CRD42019127154). RESULTS: Eighteen studies were eligible for inclusion. Most studies reported less than half of the terms in the reporting quality checklist. The most frequently predicted stroke outcomes were mortality (7 studies) and functional outcome (5 studies). The most commonly used ML methods were random forests (9 studies), support vector machines (8 studies), decision trees (6 studies), and neural networks (6 studies). The median sample size was 475 (range 70-3184), with a median of 22 predictors (range 4-152) considered. All studies evaluated discrimination with thirteen using area under the ROC curve whilst calibration was assessed in three. Two studies performed external validation. None described the final model sufficiently well to reproduce it. CONCLUSIONS: The use of ML for predicting stroke outcomes is increasing. However, few met basic reporting standards for clinical prediction tools and none made their models available in a way which could be used or evaluated. Major improvements in ML study conduct and reporting are needed before it can meaningfully be considered for practice.

Asunto(s)

Aprendizaje Automático , Accidente Cerebrovascular/diagnóstico , Humanos , Modelos Estadísticos , Pronóstico

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA