Your browser doesn't support javascript.
loading
Prediction of Breast Cancer using Machine Learning Approaches.
Rabiei, Reza; Ayyoubzadeh, Seyed Mohammad; Sohrabei, Solmaz; Esmaeili, Marzieh; Atashi, Alireza.
Afiliação
  • Rabiei R; PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
  • Ayyoubzadeh SM; PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Tehran University of Medical Science, Tehran, Iran.
  • Sohrabei S; MSc, Department Deputy of Development, Management and Resources, Office of Statistic and Information Technology Management, Zanjan University of Medical Sciences, Zanjan, Iran.
  • Esmaeili M; PhD, Department of Health Information Technology and Management, School of Allied Medical Sciences, Tehran University of Medical Science, Tehran, Iran.
  • Atashi A; PhD, Department of E-Health, Virtual School, Tehran University of Medical Sciences, Medical Informatics Research Group, Clinical Research Department, Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran.
J Biomed Phys Eng ; 12(3): 297-308, 2022 Jun.
Article em En | MEDLINE | ID: mdl-35698545
ABSTRACT

Background:

Breast cancer is considered one of the most common cancers in women caused by various clinical, lifestyle, social, and economic factors. Machine learning has the potential to predict breast cancer based on features hidden in data.

Objective:

This study aimed to predict breast cancer using different machine-learning approaches applying demographic, laboratory, and mammographic data. Material and

Methods:

In this analytical study, the database, including 5,178 independent records, 25% of which belonged to breast cancer patients with 24 attributes in each record was obtained from Motamed cancer institute (ACECR), Tehran, Iran. The database contained 5,178 independent records, 25% of which belonged to breast cancer patients containing 24 attributes in each record. The random forest (RF), neural network (MLP), gradient boosting trees (GBT), and genetic algorithms (GA) were used in this study. Models were initially trained with demographic and laboratory features (20 features). The models were then trained with all demographic, laboratory, and mammographic features (24 features) to measure the effectiveness of mammography features in predicting breast cancer.

Results:

RF presented higher performance compared to other techniques (accuracy 80%, sensitivity 95%, specificity 80%, and the area under the curve (AUC) 0.56). Gradient boosting (AUC=0.59) showed a stronger performance compared to the neural network.

Conclusion:

Combining multiple risk factors in modeling for breast cancer prediction could help the early diagnosis of the disease with necessary care plans. Collection, storage, and management of different data and intelligent systems based on multiple factors for predicting breast cancer are effective in disease management.
Palavras-chave

Texto completo: 1 Bases de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies / Screening_studies Idioma: En Revista: J Biomed Phys Eng Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Irã

Texto completo: 1 Bases de dados: MEDLINE Tipo de estudo: Prognostic_studies / Risk_factors_studies / Screening_studies Idioma: En Revista: J Biomed Phys Eng Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Irã