RESUMO
PURPOSE: We determined whether a large, multianalyte panel of circulating biomarkers can improve detection of early-stage pancreatic ductal adenocarcinoma (PDAC). MATERIALS AND METHODS: We defined a biologically relevant subspace of blood analytes on the basis of previous identification in premalignant lesions or early-stage PDAC and evaluated each in pilot studies. The 31 analytes that met minimum diagnostic accuracy were measured in serum of 837 subjects (461 healthy, 194 benign pancreatic disease, and 182 early-stage PDAC). We used machine learning to develop classification algorithms using the relationship between subjects on the basis of their changes across the predictors. Model performance was subsequently evaluated in an independent validation data set from 186 additional subjects. RESULTS: A classification model was trained on 669 subjects (358 healthy, 159 benign, and 152 early-stage PDAC). Model evaluation on a hold-out test set of 168 subjects (103 healthy, 35 benign, and 30 early-stage PDAC) yielded an area under the receiver operating characteristic curve (AUC) of 0.920 for classification of PDAC from non-PDAC (benign and healthy controls) and an AUC of 0.944 for PDAC versus healthy controls. The algorithm was then validated in 146 subsequent cases presenting with pancreatic disease (73 benign pancreatic disease and 73 early- and late-stage PDAC cases) and 40 healthy control subjects. The validation set yielded an AUC of 0.919 for classification of PDAC from non-PDAC and an AUC of 0.925 for PDAC versus healthy controls. CONCLUSION: Individually weak serum biomarkers can be combined into a strong classification algorithm to develop a blood test to identify patients who may benefit from further testing.