RESUMEN
BACKGROUND: Clinical data warehouses provide access to massive amounts of medical images, but these images are often heterogeneous. They can for instance include images acquired both with or without the injection of a gadolinium-based contrast agent. Harmonizing such data sets is thus fundamental to guarantee unbiased results, for example when performing differential diagnosis. Furthermore, classical neuroimaging software tools for feature extraction are typically applied only to images without gadolinium. The objective of this work is to evaluate how image translation can be useful to exploit a highly heterogeneous data set containing both contrast-enhanced and non-contrast-enhanced images from a clinical data warehouse. METHODS: We propose and compare different 3D U-Net and conditional GAN models to convert contrast-enhanced T1-weighted (T1ce) into non-contrast-enhanced (T1nce) brain MRI. These models were trained using 230 image pairs and tested on 77 image pairs from the clinical data warehouse of the Greater Paris area. RESULTS: Validation using standard image similarity measures demonstrated that the similarity between real and synthetic T1nce images was higher than between real T1nce and T1ce images for all the models compared. The best performing models were further validated on a segmentation task. We showed that tissue volumes extracted from synthetic T1nce images were closer to those of real T1nce images than volumes extracted from T1ce images. CONCLUSION: We showed that deep learning models initially developed with research quality data could synthesize T1nce from T1ce images of clinical quality and that reliable features could be extracted from the synthetic images, thus demonstrating the ability of such methods to help exploit a data set coming from a clinical data warehouse.
Asunto(s)
Data Warehousing , Gadolinio , Humanos , Encéfalo/diagnóstico por imagen , Imagen por Resonancia Magnética/métodos , Neuroimagen/métodos , Procesamiento de Imagen Asistido por Computador/métodosRESUMEN
Containing the medical data of millions of patients, clinical data warehouses (CDWs) represent a great opportunity to develop computational tools. Magnetic resonance images (MRIs) are particularly sensitive to patient movements during image acquisition, which will result in artefacts (blurring, ghosting and ringing) in the reconstructed image. As a result, a significant number of MRIs in CDWs are corrupted by these artefacts and may be unusable. Since their manual detection is impossible due to the large number of scans, it is necessary to develop tools to automatically exclude (or at least identify) images with motion in order to fully exploit CDWs. In this paper, we propose a novel transfer learning method from research to clinical data for the automatic detection of motion in 3D T1-weighted brain MRI. The method consists of two steps: a pre-training on research data using synthetic motion, followed by a fine-tuning step to generalise our pre-trained model to clinical data, relying on the labelling of 4045 images. The objectives were both (1) to be able to exclude images with severe motion, (2) to detect mild motion artefacts. Our approach achieved excellent accuracy for the first objective with a balanced accuracy nearly similar to that of the annotators (balanced accuracy>80 %). However, for the second objective, the performance was weaker and substantially lower than that of human raters. Overall, our framework will be useful to take advantage of CDWs in medical imaging and highlight the importance of a clinical validation of models trained on research data.
Asunto(s)
Artefactos , Data Warehousing , Humanos , Movimiento (Física) , Encéfalo/diagnóstico por imagen , Imagen por Resonancia MagnéticaRESUMEN
A variety of algorithms have been proposed for computer-aided diagnosis of dementia from anatomical brain MRI. These approaches achieve high accuracy when applied to research data sets but their performance on real-life clinical routine data has not been evaluated yet. The aim of this work was to study the performance of such approaches on clinical routine data, based on a hospital data warehouse, and to compare the results to those obtained on a research data set. The clinical data set was extracted from the hospital data warehouse of the Greater Paris area, which includes 39 different hospitals. The research set was composed of data from the Alzheimer's Disease Neuroimaging Initiative data set. In the clinical set, the population of interest was identified by exploiting the diagnostic codes from the 10th revision of the International Classification of Diseases that are assigned to each patient. We studied how the imbalance of the training sets, in terms of contrast agent injection and image quality, may bias the results. We demonstrated that computer-aided diagnosis performance was strongly biased upwards (over 17 percent points of balanced accuracy) by the confounders of image quality and contrast agent injection, a phenomenon known as the Clever Hans effect or shortcut learning. When these biases were removed, the performance was very poor. In any case, the performance was considerably lower than on the research data set. Our study highlights that there are still considerable challenges for translating dementia computer-aided diagnosis systems to clinical routine.
Asunto(s)
Enfermedad de Alzheimer , Medios de Contraste , Humanos , Data Warehousing , Encéfalo/diagnóstico por imagen , Imagen por Resonancia Magnética/métodos , Enfermedad de Alzheimer/diagnóstico por imagen , Aprendizaje Automático , ComputadoresRESUMEN
OBJECTIVE: Osteoporosis is underdiagnosed and undertreated, although severe complications of osteoporotic fractures, including vertebral fractures, are well known. This study sought to assess the feasibility and results of an opportunistic screening of vertebral fractures and osteoporosis in a large database of lumbar or abdominal CT scans. MATERIAL AND METHODS: Data were analysed from CT scans obtained in 35 hospitals from patients aged 60 years or older and stored in a Picture Archiving and Communication System in Assistance-Publique-Hôpitaux de Paris, from 2007 to 2013. Dedicated software was used to analyse the presence or absence of at least 1 vertebral fracture (VF), and the radiodensity of the lumbar vertebrae was measured Hounsfield Units (HUs). A simulated T-score was calculated. RESULTS: Data were analysed from 152 268 patients [mean age (S.D.) = 73.2 (9.07) years]. Success rates for VF assessment and HUs measurements were 82 and 87%, respectively. The prevalence of VFs was 24.5% and increased with age. Areas under the receiver operating characteristic curves for the detection of VFs were 0.61 and 0.62 for the mean HUs of the lumbar vertebrae and the L1 HUs, respectively. In patients without VFs, HUs decreased with age, similarly in males and females. The prevalence of osteoporosis (sT-score ≤ -2.5) was 23.8% and 36.5% in patients without and with VFs, respectively. CONCLUSION: It is feasible on a large scale to screen for VFs and osteoporosis during opportunistic screening in patients 60 years or older having lumbar or abdominal CT.
Asunto(s)
Osteoporosis , Fracturas Osteoporóticas , Fracturas de la Columna Vertebral , Absorciometría de Fotón/métodos , Anciano , Densidad Ósea , Femenino , Humanos , Vértebras Lumbares/diagnóstico por imagen , Masculino , Osteoporosis/complicaciones , Osteoporosis/diagnóstico por imagen , Osteoporosis/epidemiología , Fracturas Osteoporóticas/diagnóstico por imagen , Fracturas Osteoporóticas/epidemiología , Fracturas Osteoporóticas/etiología , Fracturas de la Columna Vertebral/diagnóstico por imagen , Fracturas de la Columna Vertebral/epidemiología , Fracturas de la Columna Vertebral/etiología , Tomografía Computarizada por Rayos X/métodosRESUMEN
Many studies on machine learning (ML) for computer-aided diagnosis have so far been mostly restricted to high-quality research data. Clinical data warehouses, gathering routine examinations from hospitals, offer great promises for training and validation of ML models in a realistic setting. However, the use of such clinical data warehouses requires quality control (QC) tools. Visual QC by experts is time-consuming and does not scale to large datasets. In this paper, we propose a convolutional neural network (CNN) for the automatic QC of 3D T1-weighted brain MRI for a large heterogeneous clinical data warehouse. To that purpose, we used the data warehouse of the hospitals of the Greater Paris area (Assistance Publique-Hôpitaux de Paris [AP-HP]). Specifically, the objectives were: 1) to identify images which are not proper T1-weighted brain MRIs; 2) to identify acquisitions for which gadolinium was injected; 3) to rate the overall image quality. We used 5000 images for training and validation and a separate set of 500 images for testing. In order to train/validate the CNN, the data were annotated by two trained raters according to a visual QC protocol that we specifically designed for application in the setting of a data warehouse. For objectives 1 and 2, our approach achieved excellent accuracy (balanced accuracy and F1-score >90%), similar to the human raters. For objective 3, the performance was good but substantially lower than that of human raters. Nevertheless, the automatic approach accurately identified (balanced accuracy and F1-score >80%) low quality images, which would typically need to be excluded. Overall, our approach shall be useful for exploiting hospital data warehouses in medical image computing.