Non-invasive classification of non-small cell lung cancer: a comparison between random forest models utilising radiomic and semantic features.

Bashir, Usman; Kawa, Bhavin; Siddique, Muhammad; Mak, Sze Mun; Nair, Arjun; Mclean, Emma; Bille, Andrea; Goh, Vicky; Cook, Gary

Bashir, Usman; Kawa, Bhavin; Siddique, Muhammad; Mak, Sze Mun; Nair, Arjun; Mclean, Emma; Bille, Andrea; Goh, Vicky; Cook, Gary.

Afiliação

Bashir U; 1 Cancer Imaging Department, School of Biomedical Engineering and Imaging Sciences, King's College London , London , UK.
Kawa B; 2 Department of Radiology, Maidstone Hospital, Hermitage Lane , Maidstone , UK.
Siddique M; 1 Cancer Imaging Department, School of Biomedical Engineering and Imaging Sciences, King's College London , London , UK.
Mak SM; 3 Department of Radiology, Guy's Hospital, Great Maze Pond , London , UK.
Nair A; 3 Department of Radiology, Guy's Hospital, Great Maze Pond , London , UK.
Mclean E; 4 Department of Pathology, Guy's Hospital and St Thomas' NHS Foundation Trust, Westminster Bridge Rd, Lambeth , London , UK.
Bille A; 5 Department of Thoracic Surgery, Guy's Hospital, Great Maze Pond , London , UK.
Goh V; 3 Department of Radiology, Guy's Hospital, Great Maze Pond , London , UK.
Cook G; 6 PET Imaging Centre and the Division of Imaging Sciences and Biomedical Engineering, King's College London, , UK.

Br J Radiol ; 92(1099): 20190159, 2019 Jul.

Article em En | MEDLINE | ID: mdl-31166787

RESUMO

OBJECTIVE: Non-invasive distinction between squamous cell carcinoma and adenocarcinoma subtypes of non-small-cell lung cancer (NSCLC) may be beneficial to patients unfit for invasive diagnostic procedures or when tissue is insufficient for diagnosis. The purpose of our study was to compare the performance of random forest algorithms utilizing CT radiomics and/or semantic features in classifying NSCLC. METHODS: Two thoracic radiologists scored 11 semantic features on CT scans of 106 patients with NSCLC. A set of 115 radiomics features was extracted from the CT scans. Random forest models were developed from semantic (RM-sem), radiomics (RM-rad), and all features combined (RM-all). External validation of models was performed using an independent test data set (n = 100) of CT scans. Model performance was measured with out-of-bag error and area under curve (AUC), and compared using receiver-operating characteristics curve analysis on the test data set. RESULTS: The median (interquartile-range) error rates of the models were: RF-sem 24.5 % (22.6 - 37.5 %), RF-rad 35.8 % (34.9 - 38.7 %), and RM-all 37.7 % (37.7 - 37.7). On training data, both RF-rad and RF-all gave perfect discrimination (AUC = 1), which was significantly higher than that achieved by RF-sem (AUC = 0.78; p < 0.0001). On test data, however, RM-sem model (AUC = 0.82) out-performed RM-rad and RM-all (AUC = 0.5 and AUC = 0.56; p < 0.0001), neither of which was significantly different from random guess ( p = 0.9 and 0.6 respectively). CONCLUSION: Non-invasive classification of NSCLC can be done accurately using random forest classification models based on well-known CT-derived descriptive features. However, radiomics-based classification models performed poorly in this scenario when tested on independent data and should be used with caution, due to their possible lack of generalizability to new data. ADVANCES IN KNOWLEDGE: Our study describes novel CT-derived random forest models based on radiologist-interpretation of CT scans (semantic features) that can assist NSCLC classification when histopathology is equivocal or when histopathological sampling is not possible. It also shows that random forest models based on semantic features may be more useful than those built from computational radiomic features.

Assuntos

Adenocarcinoma/diagnóstico por imagem; Carcinoma Pulmonar de Células não Pequenas/diagnóstico por imagem; Carcinoma de Células Escamosas/diagnóstico por imagem; Neoplasias Pulmonares/diagnóstico por imagem; Tomografia Computadorizada por Raios X/métodos; Adulto; Idoso; Idoso de 80 Anos ou mais; Algoritmos; Feminino; Humanos; Pulmão/diagnóstico por imagem; Masculino; Pessoa de Meia-Idade; Radiometria; Reprodutibilidade dos Testes; Semântica

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Carcinoma de Células Escamosas / Adenocarcinoma / Tomografia Computadorizada por Raios X / Carcinoma Pulmonar de Células não Pequenas / Neoplasias Pulmonares Tipo de estudo: Clinical_trials / Prognostic_studies Limite: Adult / Aged / Aged80 / Female / Humans / Male / Middle aged Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google