Your browser doesn't support javascript.
loading
Deep-learning-assisted diagnosis for knee magnetic resonance imaging: Development and retrospective validation of MRNet.
Bien, Nicholas; Rajpurkar, Pranav; Ball, Robyn L; Irvin, Jeremy; Park, Allison; Jones, Erik; Bereket, Michael; Patel, Bhavik N; Yeom, Kristen W; Shpanskaya, Katie; Halabi, Safwan; Zucker, Evan; Fanton, Gary; Amanatullah, Derek F; Beaulieu, Christopher F; Riley, Geoffrey M; Stewart, Russell J; Blankenberg, Francis G; Larson, David B; Jones, Ricky H; Langlotz, Curtis P; Ng, Andrew Y; Lungren, Matthew P.
Afiliação
  • Bien N; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Rajpurkar P; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Ball RL; Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, California, United States of America.
  • Irvin J; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Park A; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Jones E; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Bereket M; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Patel BN; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Yeom KW; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Shpanskaya K; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Halabi S; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Zucker E; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Fanton G; Department of Orthopedic Surgery, Stanford University, Stanford, California, United States of America.
  • Amanatullah DF; Department of Orthopedic Surgery, Stanford University, Stanford, California, United States of America.
  • Beaulieu CF; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Riley GM; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Stewart RJ; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Blankenberg FG; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Larson DB; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Jones RH; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Langlotz CP; Department of Radiology, Stanford University, Stanford, California, United States of America.
  • Ng AY; Department of Computer Science, Stanford University, Stanford, California, United States of America.
  • Lungren MP; Department of Radiology, Stanford University, Stanford, California, United States of America.
PLoS Med ; 15(11): e1002699, 2018 11.
Article em En | MEDLINE | ID: mdl-30481176
ABSTRACT

BACKGROUND:

Magnetic resonance imaging (MRI) of the knee is the preferred method for diagnosing knee injuries. However, interpretation of knee MRI is time-intensive and subject to diagnostic error and variability. An automated system for interpreting knee MRI could prioritize high-risk patients and assist clinicians in making diagnoses. Deep learning methods, in being able to automatically learn layers of features, are well suited for modeling the complex relationships between medical images and their interpretations. In this study we developed a deep learning model for detecting general abnormalities and specific diagnoses (anterior cruciate ligament [ACL] tears and meniscal tears) on knee MRI exams. We then measured the effect of providing the model's predictions to clinical experts during interpretation. METHODS AND

FINDINGS:

Our dataset consisted of 1,370 knee MRI exams performed at Stanford University Medical Center between January 1, 2001, and December 31, 2012 (mean age 38.0 years; 569 [41.5%] female patients). The majority vote of 3 musculoskeletal radiologists established reference standard labels on an internal validation set of 120 exams. We developed MRNet, a convolutional neural network for classifying MRI series and combined predictions from 3 series per exam using logistic regression. In detecting abnormalities, ACL tears, and meniscal tears, this model achieved area under the receiver operating characteristic curve (AUC) values of 0.937 (95% CI 0.895, 0.980), 0.965 (95% CI 0.938, 0.993), and 0.847 (95% CI 0.780, 0.914), respectively, on the internal validation set. We also obtained a public dataset of 917 exams with sagittal T1-weighted series and labels for ACL injury from Clinical Hospital Centre Rijeka, Croatia. On the external validation set of 183 exams, the MRNet trained on Stanford sagittal T2-weighted series achieved an AUC of 0.824 (95% CI 0.757, 0.892) in the detection of ACL injuries with no additional training, while an MRNet trained on the rest of the external data achieved an AUC of 0.911 (95% CI 0.864, 0.958). We additionally measured the specificity, sensitivity, and accuracy of 9 clinical experts (7 board-certified general radiologists and 2 orthopedic surgeons) on the internal validation set both with and without model assistance. Using a 2-sided Pearson's chi-squared test with adjustment for multiple comparisons, we found no significant differences between the performance of the model and that of unassisted general radiologists in detecting abnormalities. General radiologists achieved significantly higher sensitivity in detecting ACL tears (p-value = 0.002; q-value = 0.019) and significantly higher specificity in detecting meniscal tears (p-value = 0.003; q-value = 0.019). Using a 1-tailed t test on the change in performance metrics, we found that providing model predictions significantly increased clinical experts' specificity in identifying ACL tears (p-value < 0.001; q-value = 0.006). The primary limitations of our study include lack of surgical ground truth and the small size of the panel of clinical experts.

CONCLUSIONS:

Our deep learning model can rapidly generate accurate clinical pathology classifications of knee MRI exams from both internal and external datasets. Moreover, our results support the assertion that deep learning models can improve the performance of clinical experts during medical imaging interpretation. Further research is needed to validate the model prospectively and to determine its utility in the clinical setting.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Imageamento por Ressonância Magnética / Interpretação de Imagem Assistida por Computador / Diagnóstico por Computador / Lesões do Ligamento Cruzado Anterior / Lesões do Menisco Tibial / Aprendizado Profundo / Joelho Tipo de estudo: Diagnostic_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Adult / Female / Humans / Male / Middle aged Idioma: En Revista: PLoS Med Assunto da revista: MEDICINA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Imageamento por Ressonância Magnética / Interpretação de Imagem Assistida por Computador / Diagnóstico por Computador / Lesões do Ligamento Cruzado Anterior / Lesões do Menisco Tibial / Aprendizado Profundo / Joelho Tipo de estudo: Diagnostic_studies / Observational_studies / Prognostic_studies / Risk_factors_studies Limite: Adult / Female / Humans / Male / Middle aged Idioma: En Revista: PLoS Med Assunto da revista: MEDICINA Ano de publicação: 2018 Tipo de documento: Article País de afiliação: Estados Unidos