Your browser doesn't support javascript.
loading
Exploring Prognostic Gene Factors in Breast Cancer via Machine Learning.
Ma, QingLan; Chen, Lei; Feng, KaiYan; Guo, Wei; Huang, Tao; Cai, Yu-Dong.
Afiliación
  • Ma Q; School of Life Sciences, Shanghai University, Shanghai, 200444, China.
  • Chen L; College of Information Engineering, Shanghai Maritime University, Shanghai, 201306, China.
  • Feng K; Department of Computer Science, Guangdong AIB Polytechnic College, Guangzhou, 510507, China.
  • Guo W; Key Laboratory of Stem Cell Biology, Shanghai Jiao Tong University School of Medicine (SJTUSM) & Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai, 200030, China.
  • Huang T; Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China. tohuangtao@126.com.
  • Cai YD; CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China. tohuangtao@126.com.
Biochem Genet ; 2024 Feb 21.
Article en En | MEDLINE | ID: mdl-38383836
ABSTRACT
Breast cancer remains the most prevalent cancer in women. To date, its underlying molecular mechanisms have not been fully uncovered. The determination of gene factors is important to improve our understanding on breast cancer, which can correlate the specific gene expression and tumor staging. However, the knowledge in this regard is still far from complete. Thus, this study aimed to explore these knowledge gaps by analyzing existing gene expression profile data from 3149 breast cancer samples, where each sample was represented by the expression of 19,644 genes and classified into Nottingham histological grade (NHG) classes (Grade 1, 2, and 3). To this end, a machine learning-based framework was designed. First, the profile data were analyzed by using seven feature ranking algorithms to evaluate the importance of features (genes). Seven feature lists were generated, each of which sorted features in accordance with feature importance evaluated from a special aspect. Then, the incremental feature selection method was applied to each list to determine essential features for classification and building efficient classifiers. Consequently, overlapping genes, such as AURKA, CBX2, and MYBL2, were deemed as potentially related to breast cancer malignancy and prognosis, indicating that such genes were identified to be important by multiple feature ranking algorithms. In addition, the study formulated classification rules to reflect special gene expression patterns for three NHG classes. Some genes and rules were analyzed and supported by recent literature, providing new references for studying breast cancer.
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Idioma: En Revista: Biochem Genet Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Banco de datos: MEDLINE Idioma: En Revista: Biochem Genet Año: 2024 Tipo del documento: Article País de afiliación: China