RESUMEN
Lung cancer is a leading cause of death worldwide, mostly due to diagnostics in the advanced stage. Therefore, the development of a quick, simple, and non-invasive diagnostic tool to identify cancer is essential. However, the creation of a reliable diagnostic tool is possible only in case of selectivity to other diseases, particularly, cancer of other localizations. This paper is devoted to the study of the variability of exhaled breath samples among patients with lung cancer and cancer of other localizations, such as esophageal, breast, colorectal, kidney, stomach, prostate, cervix, and skin. For this, gas chromatography-mass spectrometry (GC-MS) was used. Two classification models were built. The first model separated patients with lung cancer and cancer of other localizations. The second model classified patients with lung, esophageal, breast, colorectal, and kidney cancer. Mann-Whitney U tests and Kruskal-Wallis H tests were applied to identify differences in investigated groups. Discriminant analysis (DA), gradient-boosted decision trees (GBDT), and artificial neural networks (ANN) were applied to create the models. In the case of classifying lung cancer and cancer of other localizations, average sensitivity and specificity were 68% and 69%, respectively. However, the accuracy of classifying groups of patients with lung, esophageal, breast, colorectal, and kidney cancer was poor.
Asunto(s)
Carcinoma de Células Renales , Neoplasias Colorrectales , Neoplasias Renales , Neoplasias Pulmonares , Femenino , Masculino , Humanos , Neoplasias Pulmonares/diagnóstico , BiomarcadoresRESUMEN
Non-invasive, simple, and fast tests for lung cancer diagnostics are one of the urgent needs for clinical practice. The work describes the results of exhaled breath analysis of 112 lung cancer patients and 120 healthy individuals using gas chromatography-mass spectrometry (GC-MS). Volatile organic compound (VOC) peak areas and their ratios were considered for data analysis. VOC profiles of patients with various histological types, tumor localization, TNM stage, and treatment status were considered. The effect of non-pulmonary comorbidities (chronic heart failure, hypertension, anemia, acute cerebrovascular accident, obesity, diabetes) on exhaled breath composition of lung cancer patients was studied for the first time. Significant correlations between some VOC peak areas and their ratios and these factors were found. Diagnostic models were created using gradient boosted decision trees (GBDT) and artificial neural network (ANN). The performance of developed models was compared. ANN model was the most accurate: 82-88% sensitivity and 80-86% specificity on the test data.