Your browser doesn't support javascript.
loading
Predicting Promoters in Multiple Prokaryotes with Prompt.
Du, Qimeng; Guo, Yixue; Zhang, Junpeng; Lu, Fuping; Peng, Chong; Zhou, Chichun.
Afiliação
  • Du Q; School of Engineering, Air-Space-Ground Integrated Intelligence and Big Data Application Engineering Research Center of Yunnan Provincial Department of Education, Dali University, Dali, 671003, China.
  • Guo Y; College of Biotechnology, Tianjin University of Science & Technology, Tianjin, 300457, China.
  • Zhang J; School of Engineering, Air-Space-Ground Integrated Intelligence and Big Data Application Engineering Research Center of Yunnan Provincial Department of Education, Dali University, Dali, 671003, China.
  • Lu F; College of Biotechnology, Tianjin University of Science & Technology, Tianjin, 300457, China.
  • Peng C; College of Biotechnology, Tianjin University of Science & Technology, Tianjin, 300457, China. cpeng@tust.edu.cn.
  • Zhou C; School of Engineering, Air-Space-Ground Integrated Intelligence and Big Data Application Engineering Research Center of Yunnan Provincial Department of Education, Dali University, Dali, 671003, China. zhouchichun@dali.edu.cn.
Interdiscip Sci ; 2024 Aug 07.
Article em En | MEDLINE | ID: mdl-39110340
ABSTRACT
Promoters are important cis-regulatory elements for the regulation of gene expression, and their accurate predictions are crucial for elucidating the biological functions and potential mechanisms of genes. Many previous prokaryotic promoter prediction methods are encouraging in terms of the prediction performance, but most of them focus on the recognition of promoters in only one or a few bacterial species. Moreover, due to ignoring the promoter sequence motifs, the interpretability of predictions with existing methods is limited. In this work, we present a generalized method Prompt (Promoters in multiple prokaryotes) to predict promoters in 16 prokaryotes and improve the interpretability of prediction results. Prompt integrates three methods including RSK (Regression based on Selected k-mer), CL (Contrastive Learning) and MLP (Multilayer Perception), and employs a voting strategy to divide the datasets into high-confidence and low-confidence categories. Results on the promoter prediction tasks in 16 prokaryotes show that the accuracy (Accuracy, Matthews correlation coefficient) of Prompt is greater than 80% in highly credible datasets of 16 prokaryotes, and is greater than 90% in 12 prokaryotes, and Prompt performs the best compared with other existing methods. Moreover, by identifying promoter sequence motifs, Prompt can improve the interpretability of the predictions. Prompt is freely available at https//github.com/duqimeng/PromptPrompt , and will contribute to the research of promoters in prokaryote.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Interdiscip Sci Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Interdiscip Sci Ano de publicação: 2024 Tipo de documento: Article