Your browser doesn't support javascript.
loading
Unlocking the complete blood count as a risk stratification tool for breast cancer using machine learning: a large scale retrospective study.
Araujo, Daniella Castro; Rocha, Bruno Aragão; Gomes, Karina Braga; da Silva, Daniel Noce; Ribeiro, Vinicius Moura; Kohara, Marco Aurelio; Tostes Marana, Fernanda; Bitar, Renata Andrade; Veloso, Adriano Alonso; Pintao, Maria Carolina; da Silva, Flavia Helena; Viana, Celso Ferraz; de Souza, Pedro Henrique Araújo; da Silva, Ismael Dale Cotrim Guerreiro.
Afiliação
  • Araujo DC; Huna, São Paulo, Brazil. danicastroaraujo@gmail.com.
  • Rocha BA; Departamento de Ciências da Computação, Instituto de Ciências Exatas, Universidade Federal de Minas Gerais/UFMG, Campus Belo Horizonte, Minas Gerais, Brazil. danicastroaraujo@gmail.com.
  • Gomes KB; Grupo Fleury, São Paulo, Brazil.
  • da Silva DN; Departamento de Análises Clínicas e Toxicológicas, Faculdade de Farmácia, Universidade Federal de Minas Gerais/UFMG, Campus Belo Horizonte, Minas Gerais, Brazil.
  • Ribeiro VM; Huna, São Paulo, Brazil.
  • Kohara MA; Huna, São Paulo, Brazil.
  • Tostes Marana F; Huna, São Paulo, Brazil.
  • Bitar RA; Huna, São Paulo, Brazil.
  • Veloso AA; Huna, São Paulo, Brazil.
  • Pintao MC; Departamento de Ciências da Computação, Instituto de Ciências Exatas, Universidade Federal de Minas Gerais/UFMG, Campus Belo Horizonte, Minas Gerais, Brazil.
  • da Silva FH; Grupo Fleury, São Paulo, Brazil.
  • Viana CF; Grupo Fleury, São Paulo, Brazil.
  • de Souza PHA; Grupo Fleury, São Paulo, Brazil.
  • da Silva IDCG; Huna, São Paulo, Brazil.
Sci Rep ; 14(1): 10841, 2024 05 12.
Article em En | MEDLINE | ID: mdl-38736010
ABSTRACT
Optimizing early breast cancer (BC) detection requires effective risk assessment tools. This retrospective study from Brazil showcases the efficacy of machine learning in discerning complex patterns within routine blood tests, presenting a globally accessible and cost-effective approach for risk evaluation. We analyzed complete blood count (CBC) tests from 396,848 women aged 40-70, who underwent breast imaging or biopsies within six months after their CBC test. Of these, 2861 (0.72%) were identified as cases 1882 with BC confirmed by anatomopathological tests, and 979 with highly suspicious imaging (BI-RADS 5). The remaining 393,987 participants (99.28%), with BI-RADS 1 or 2 results, were classified as controls. The database was divided into modeling (including training and validation) and testing sets based on diagnostic certainty. The testing set comprised cases confirmed by anatomopathology and controls cancer-free for 4.5-6.5 years post-CBC. Our ridge regression model, incorporating neutrophil-lymphocyte ratio, red blood cells, and age, achieved an AUC of 0.64 (95% CI 0.64-0.65). We also demonstrate that these results are slightly better than those from a boosting machine learning model, LightGBM, plus having the benefit of being fully interpretable. Using the probabilistic output from this model, we divided the study population into four risk groups high, moderate, average, and low risk, which obtained relative ratios of BC of 1.99, 1.32, 1.02, and 0.42, respectively. The aim of this stratification was to streamline prioritization, potentially improving the early detection of breast cancer, particularly in resource-limited environments. As a risk stratification tool, this model offers the potential for personalized breast cancer screening by prioritizing women based on their individual risk, thereby indicating a shift from a broad population strategy.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias da Mama / Aprendizado de Máquina Limite: Adult / Aged / Female / Humans / Middle aged País como assunto: America do sul / Brasil Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias da Mama / Aprendizado de Máquina Limite: Adult / Aged / Female / Humans / Middle aged País como assunto: America do sul / Brasil Idioma: En Ano de publicação: 2024 Tipo de documento: Article