CancerEMC: frontline non-invasive cancer screening from circulating protein biomarkers and mutations in cell-free DNA.
Bioinformatics
; 37(19): 3319-3327, 2021 Oct 11.
Article
em En
| MEDLINE
| ID: mdl-33515231
MOTIVATION: The early detection of cancer through accessible blood tests can foster early patient interventions. Although there are developments in cancer detection from cell-free DNA (cfDNA), its accuracy remains speculative. Given its central importance with broad impacts, we aspire to address the challenge. METHOD: A bagging Ensemble Meta Classifier (CancerEMC) is proposed for early cancer detection based on circulating protein biomarkers and mutations in cfDNA from blood. CancerEMC is generally designed for both binary cancer detection and multi-class cancer type localization. It can address the class imbalance problem in multi-analyte blood test data based on robust oversampling and adaptive synthesis techniques. RESULTS: Based on the clinical blood test data, we observe that the proposed CancerEMC has outperformed other algorithms and state-of-the-arts studies (including CancerSEEK) for cancer detection. The results reveal that our proposed method (i.e. CancerEMC) can achieve the best performance result for both binary cancer classification with 99.17% accuracy (AUC = 0.999) and localized multiple cancer detection with 74.12% accuracy (AUC = 0.938). Addressing the data imbalance issue with oversampling techniques, the accuracy can be increased to 91.50% (AUC = 0.992), where the state-of-the-art method can only be estimated at 69.64% (AUC = 0.921). Similar results can also be observed on independent and isolated testing data. AVAILABILITY: https://github.com/saifurcubd/Cancer-Detection. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Texto completo:
1
Base de dados:
MEDLINE
Idioma:
En
Ano de publicação:
2021
Tipo de documento:
Article