Your browser doesn't support javascript.
loading
Development of fecal microbial diagnostic marker sets of colorectal cancer using natural language processing method.
Liu, Houcong; Song, Changpu; Wang, Jidong; Chen, Zhufang; Zhang, Xiaohong; Zhou, Hekai; Yao, Linhong; Chen, Dan; Gu, Wenhao; Huang, Rui-Kun; Huang, Bing-Kun; Han, Bo-Wei; Du, Jihui.
Afiliação
  • Liu H; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
  • Song C; Guangdong Jiyin Biotech Co. Ltd, Shenzhen, Guangdong, China.
  • Wang J; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
  • Chen Z; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
  • Zhang X; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
  • Zhou H; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
  • Yao L; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
  • Chen D; Guangdong Jiyin Biotech Co. Ltd, Shenzhen, Guangdong, China.
  • Gu W; Guangdong Jiyin Biotech Co. Ltd, Shenzhen, Guangdong, China.
  • Huang RK; Guangdong Jiyin Biotech Co. Ltd, Shenzhen, Guangdong, China.
  • Huang BK; Guangdong Jiyin Biotech Co. Ltd, Shenzhen, Guangdong, China.
  • Han BW; Guangdong Jiyin Biotech Co. Ltd, Shenzhen, Guangdong, China.
  • Du J; Research Center for Clinical and Translational Medicine, Huazhong University of Science and Technology Union Shenzhen Hospital, and the 6th Affiliated Hospital of Shenzhen University Medical School, Shenzhen, Guangdong, China.
Int J Biol Markers ; 39(1): 31-39, 2024 Mar.
Article em En | MEDLINE | ID: mdl-38128926
ABSTRACT

BACKGROUND:

Cancer screening and early detection greatly increase the chances of successful treatment. However, most cancer types lack effective early screening biomarkers. In recent years, natural language processing (NLP)-based text-mining methods have proven effective in searching the scientific literature and identifying promising associations between potential biomarkers and disease, but unfortunately few are widely used.

METHODS:

In this study, we used an NLP-enabled text-mining system, MarkerGenie, to identify potential stool bacterial markers for early detection and screening of colorectal cancer. After filtering markers based on text-mining results, we validated bacterial markers using multiplex digital droplet polymerase chain reaction (ddPCR). Classifiers were built based on ddPCR results, and sensitivity, specificity, and area under the curve (AUC) were used to evaluate the performance.

RESULTS:

A total of 7 of the 14 bacterial markers showed significantly increased abundance in the stools of colorectal cancer patients. A five-bacteria classifier for colorectal cancer diagnosis was built, and achieved an AUC of 0.852, with a sensitivity of 0.692 and specificity of 0.935. When combined with the fecal immunochemical test (FIT), our classifier achieved an AUC of 0.959 and increased the sensitivity of FIT (0.929 vs. 0.872) at a specificity of 0.900.

CONCLUSIONS:

Our study provides a valuable case example of the use of NLP-based marker mining for biomarker identification.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Neoplasias Colorretais Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Neoplasias Colorretais Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article