RESUMEN
Background Noninvasive tests can be used to screen patients with chronic liver disease for advanced liver fibrosis; however, the use of single tests may not be adequate. Purpose To construct sequential clinical algorithms that include a US deep learning (DL) model and compare their ability to predict advanced liver fibrosis with that of other noninvasive tests. Materials and Methods This retrospective study included adult patients with a history of chronic liver disease or unexplained abnormal liver function test results who underwent B-mode US of the liver between January 2014 and September 2022 at three health care facilities. A US-based DL network (FIB-Net) was trained on US images to predict whether the shear-wave elastography (SWE) value was 8.7 kPa or higher, indicative of advanced fibrosis. In the internal and external test sets, a two-step algorithm (Two-step#1) using the Fibrosis-4 Index (FIB-4) followed by FIB-Net and a three-step algorithm (Three-step#1) using FIB-4 followed by FIB-Net and SWE were used to simulate screening scenarios where liver stiffness measurements were not or were available, respectively. Measures of diagnostic accuracy were calculated using liver biopsy as the reference standard and compared between FIB-4, SWE, FIB-Net, and European Association for the Study of the Liver guidelines (ie, FIB-4 followed by SWE), along with sequential algorithms. Results The training, validation, and test data sets included 3067 (median age, 42 years [IQR, 33-53 years]; 2083 male), 1599 (median age, 41 years [IQR, 33-51 years]; 1124 male), and 1228 (median age, 44 years [IQR, 33-55 years]; 741 male) patients, respectively. FIB-Net obtained a noninferior specificity with a margin of 5% (P < .001) compared with SWE (80% vs 82%). The Two-step#1 algorithm showed higher specificity and positive predictive value (PPV) than FIB-4 (specificity, 79% vs 57%; PPV, 44% vs 32%) while reducing unnecessary referrals by 42%. The Three-step#1 algorithm had higher specificity and PPV compared with European Association for the Study of the Liver guidelines (specificity, 94% vs 88%; PPV, 73% vs 64%) while reducing unnecessary referrals by 35%. Conclusion A sequential algorithm combining FIB-4 and a US DL model showed higher diagnostic accuracy and improved referral management for all-cause advanced liver fibrosis compared with FIB-4 or the DL model alone. © RSNA, 2024 Supplemental material is available for this article. See also the editorial by Ghosh in this issue.
Asunto(s)
Algoritmos , Diagnóstico por Imagen de Elasticidad , Cirrosis Hepática , Humanos , Masculino , Cirrosis Hepática/diagnóstico por imagen , Persona de Mediana Edad , Femenino , Estudios Retrospectivos , Diagnóstico por Imagen de Elasticidad/métodos , Adulto , Aprendizaje Profundo , Hígado/diagnóstico por imagen , Hígado/patología , Anciano , Ultrasonografía/métodosRESUMEN
Background: Tongue inspection, an essential diagnostic method in Traditional Chinese Medicine (TCM), has the potential for early-stage disease screening. This study aimed to evaluate the effectiveness of deep learning-based analysis of tongue images for hepatic fibrosis screening. Methods: A total of 1083 tongue images were collected from 741 patients and divided into training, validation, and test sets. DenseNet-201, a convolutional neural network, was employed to train the AI model using these tongue images. The predictive performance of AI was assessed and compared with that of FIB-4, using real-time two-dimensional shear wave elastography as the reference standard. Results: The proposed AI model achieved an accuracy of 0.845 (95% CI: 0.79-0.90) and 0.814 (95% CI: 0.76-0.87) in the validation and test sets, respectively, with negative predictive values (NPVs) exceeding 90% in both sets. The AI model outperformed FIB-4 in all aspects, and when combined with FIB-4, the NPV reached 94.4%. Conclusion: Tongue inspection, with the assistance of AI, could serve as a first-line screening method for hepatic fibrosis.