RESUMO
OBJECTIVES: To validate an artificial intelligence (AI)-based fully automatic coronary artery calcium (CAC) scoring system on non-electrocardiogram (ECG)-gated low-dose chest computed tomography (LDCT) using multi-institutional datasets with manual CAC scoring as the reference standard. METHODS: This retrospective study included 452 subjects from three academic institutions, who underwent both ECG-gated calcium scoring computed tomography (CSCT) and LDCT scans. For all CSCT and LDCT scans, automatic CAC scoring (CAC_auto) was performed using AI-based software, and manual CAC scoring (CAC_man) was set as the reference standard. The reliability and agreement of CAC_auto was evaluated and compared with that of CAC_man using intraclass correlation coefficients (ICCs) and Bland-Altman plots. The reliability between CAC_auto and CAC_man for CAC severity categories was analyzed using weighted kappa (κ) statistics. RESULTS: CAC_auto on CSCT and LDCT yielded a high ICC (0.998, 95% confidence interval (CI) 0.998-0.999 and 0.989, 95% CI 0.987-0.991, respectively) and a mean difference with 95% limits of agreement of 1.3 ± 37.1 and 0.8 ± 75.7, respectively. CAC_auto achieved excellent reliability for CAC severity (κ = 0.918-0.972) on CSCT and good to excellent but heterogenous reliability among datasets (κ = 0.748-0.924) on LDCT. CONCLUSIONS: The application of an AI-based automatic CAC scoring software to LDCT shows good to excellent reliability in CAC score and CAC severity categorization in multi-institutional datasets; however, the reliability varies among institutions. KEY POINTS: ⢠AI-based automatic CAC scoring on LDCT shows excellent reliability with manual CAC scoring in multi-institutional datasets. ⢠The reliability for CAC score-based severity categorization varies among datasets. ⢠Automatic scoring for LDCT shows a higher false-positive rate than automatic scoring for CSCT, and most common causes of a false-positive are image noise and artifacts for both CSCT and LDCT.
Assuntos
Cálcio , Técnicas de Imagem de Sincronização Cardíaca , Vasos Coronários , Tomografia Computadorizada por Raios X , Humanos , Inteligência Artificial , Cálcio/análise , Técnicas de Imagem de Sincronização Cardíaca/métodos , Vasos Coronários/diagnóstico por imagem , Conjuntos de Dados como Assunto , Eletrocardiografia , Estudos Multicêntricos como Assunto , Reprodutibilidade dos Testes , Estudos Retrospectivos , Tomografia Computadorizada por Raios X/métodosRESUMO
OBJECTIVES: Cardiovascular border (CB) analysis is the primary method for detecting and quantifying the severity of cardiovascular disease using posterior-anterior chest radiographs (CXRs). This study aimed to develop and validate a deep learning-based automatic CXR CB analysis algorithm (CB_auto) for diagnosing and quantitatively evaluating valvular heart disease (VHD). METHODS: We developed CB_auto using 816 normal and 798 VHD CXRs. For validation, 640 normal and 542 VHD CXRs from three different hospitals and 132 CXRs from a public dataset were assigned. The reliability of the CB parameters determined by CB_auto was evaluated. To evaluate the differences between parameters determined by CB_auto and manual CB drawing (CB_hand), the absolute percentage measurement error (APE) was calculated. Pearson correlation coefficients were calculated between CB_hand and echocardiographic measurements. RESULTS: CB parameters determined by CB_auto yielded excellent reliability (intraclass correlation coefficient > 0.98). The 95% limits of agreement for the cardiothoracic ratio were 0.00 ± 0.04% without systemic bias. The differences between parameters determined by CB_auto and CB_hand as defined by the APE were < 10% for all parameters except for carinal angle and left atrial appendage. In the public dataset, all CB parameters were successfully drawn in 124 of 132 CXRs (93.9%). All CB parameters were significantly greater in VHD than in normal controls (all p < 0.05). All CB parameters showed significant correlations (p < 0.05) with echocardiographic measurements. CONCLUSIONS: The CB_auto system empowered by deep learning algorithm provided highly reliable CB measurements that could be useful not only in daily clinical practice but also for research purposes. KEY POINTS: ⢠A deep learning-based automatic CB analysis algorithm for diagnosing and quantitatively evaluating VHD using posterior-anterior chest radiographs was developed and validated. ⢠Our algorithm (CB_auto) yielded comparable reliability to manual CB drawing (CB_hand) in terms of various CB measurement variables, as confirmed by external validation with datasets from three different hospitals and a public dataset. ⢠All CB parameters were significantly different between VHD and normal control measurements, and echocardiographic measurements were significantly correlated with CB parameters measured from normal control and VHD CXRs.