Your browser doesn't support javascript.
loading
Identifying Biomarkers Using Support Vector Machine to Understand the Racial Disparity in Triple-Negative Breast Cancer.
Sahoo, Bikram; Pinnix, Zandra; Sims, Seth; Zelikovsky, Alex.
Afiliação
  • Sahoo B; Department of Computer Science, Georgia State University, Atlanta, Georgia, USA.
  • Pinnix Z; Department of Biology and Marine Biology, University of North Carolina at Wilmington, Wilmington, North Carolina, USA.
  • Sims S; Department of Computer Science, Georgia State University, Atlanta, Georgia, USA.
  • Zelikovsky A; Department of Computer Science, Georgia State University, Atlanta, Georgia, USA.
J Comput Biol ; 30(4): 502-517, 2023 04.
Article em En | MEDLINE | ID: mdl-36716280
ABSTRACT
With the properties of aggressive cancer and heterogeneous tumor biology, triple-negative breast cancer (TNBC) is a type of breast cancer known for its poor clinical outcome. The lack of estrogen, progesterone, and human epidermal growth factor receptor in the tumors of TNBC leads to fewer treatment options in clinics. The incidence of TNBC is higher in African American (AA) women compared with European American (EA) women with worse clinical outcomes. The significant factors responsible for the racial disparity in TNBC are socioeconomic lifestyle and tumor biology. The current study considered the open-source gene expression data of triple-negative breast cancer samples' racial information. We implemented a state-of-the-art classification Support Vector Machine (SVM) method with a recurrent feature elimination approach to the gene expression data to identify significant biomarkers deregulated in AA women and EA women. We also included Spearman's rho and Ward's linkage method in our feature selection workflow. Our proposed method generates 24 features/genes that can classify the AA and EA samples 98% accurately. We also performed the Kaplan-Meier analysis and log-rank test on the 24 features/genes. We only discussed the correlation between deregulated expression and cancer progression with a poor survival rate of 2 genes, KLK10 and LRRC37A2, out of 24 genes. We believe that further improvement of our method with a higher number of RNA-seq gene expression data will more accurately provide insight into racial disparity in TNBC.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Disparidades nos Níveis de Saúde / Neoplasias de Mama Triplo Negativas Tipo de estudo: Prognostic_studies Limite: Female / Humans Idioma: En Revista: J Comput Biol Assunto da revista: BIOLOGIA MOLECULAR / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Disparidades nos Níveis de Saúde / Neoplasias de Mama Triplo Negativas Tipo de estudo: Prognostic_studies Limite: Female / Humans Idioma: En Revista: J Comput Biol Assunto da revista: BIOLOGIA MOLECULAR / INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos