Your browser doesn't support javascript.
loading
Artificial image objects for classification of breast cancer biomarkers with transcriptome sequencing data and convolutional neural network algorithms.
Chen, Xiangning; Chen, Daniel G; Zhao, Zhongming; Balko, Justin M; Chen, Jingchun.
Afiliación
  • Chen X; 410 AI, LLC, Germantown, MD, 20876, USA. va.samchen@gmail.com.
  • Chen DG; 410 AI, LLC, Germantown, MD, 20876, USA.
  • Zhao Z; Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
  • Balko JM; Department of Psychiatry and Behavioral Sciences, McGovern Medical School, The University of Texas, Houston, TX, 77030, USA.
  • Chen J; Department of Medicine, Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN, USA.
Breast Cancer Res ; 23(1): 96, 2021 10 10.
Article en En | MEDLINE | ID: mdl-34629099
ABSTRACT

BACKGROUND:

Transcriptome sequencing has been broadly available in clinical studies. However, it remains a challenge to utilize these data effectively for clinical applications due to the high dimension of the data and the highly correlated expression between individual genes.

METHODS:

We proposed a method to transform RNA sequencing data into artificial image objects (AIOs) and applied convolutional neural network (CNN) algorithms to classify these AIOs. With the AIO technique, we considered each gene as a pixel in an image and its expression level as pixel intensity. Using the GSE96058 (n = 2976), GSE81538 (n = 405), and GSE163882 (n = 222) datasets, we created AIOs for the subjects and designed CNN models to classify biomarker Ki67 and Nottingham histologic grade (NHG).

RESULTS:

With fivefold cross-validation, we accomplished a classification accuracy and AUC of 0.821 ± 0.023 and 0.891 ± 0.021 for Ki67 status. For NHG, the weighted average of categorical accuracy was 0.820 ± 0.012, and the weighted average of AUC was 0.931 ± 0.006. With GSE96058 as training data and GSE81538 as testing data, the accuracy and AUC for Ki67 were 0.826 ± 0.037 and 0.883 ± 0.016, and that for NHG were 0.764 ± 0.052 and 0.882 ± 0.012, respectively. These results were 10% better than the results reported in the original studies. For Ki67, the calls generated from our models had a better power for prediction of survival as compared to the calls from trained pathologists in survival analyses.

CONCLUSIONS:

We demonstrated that RNA sequencing data could be transformed into AIOs and be used to classify Ki67 status and NHG with CNN algorithms. The AIO method could handle high-dimensional data with highly correlated variables, and there was no need for variable selection. With the AIO technique, a data-driven, consistent, and automation-ready model could be developed to classify biomarkers with RNA sequencing data and provide more efficient care for cancer patients.
Asunto(s)
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Algoritmos / Neoplasias de la Mama / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies Límite: Female / Humans Idioma: En Revista: Breast Cancer Res Asunto de la revista: NEOPLASIAS Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Algoritmos / Neoplasias de la Mama / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies Límite: Female / Humans Idioma: En Revista: Breast Cancer Res Asunto de la revista: NEOPLASIAS Año: 2021 Tipo del documento: Article País de afiliación: Estados Unidos