An antibacterial peptides recognition method based on BERT and Text-CNN / 生物工程学报
Chinese Journal of Biotechnology
; (12): 1815-1824, 2023.
Article
de Zh
| WPRIM
| ID: wpr-981172
Bibliothèque responsable:
WPRO
ABSTRACT
Antimicrobial peptides (AMPs) are small molecule peptides that are widely found in living organisms with broad-spectrum antibacterial activity and immunomodulatory effect. Due to slower emergence of resistance, excellent clinical potential and wide range of application, AMP is a strong alternative to conventional antibiotics. AMP recognition is a significant direction in the field of AMP research. The high cost, low efficiency and long period shortcomings of the wet experiment methods prevent it from meeting the need for the large-scale AMP recognition. Therefore, computer-aided identification methods are important supplements to AMP recognition approaches, and one of the key issues is how to improve the accuracy. Protein sequences could be approximated as a language composed of amino acids. Consequently, rich features may be extracted using natural language processing (NLP) techniques. In this paper, we combine the pre-trained model BERT and the fine-tuned structure Text-CNN in the field of NLP to model protein languages, develop an open-source available antimicrobial peptide recognition tool and conduct a comparison with other five published tools. The experimental results show that the optimization of the two-phase training approach brings an overall improvement in accuracy, sensitivity, specificity, and Matthew correlation coefficient, offering a novel approach for further research on AMP recognition.
Mots clés
Texte intégral:
1
Indice:
WPRIM
Sujet Principal:
Traitement du langage naturel
/
Séquence d'acides aminés
/
Peptides antimicrobiens cationiques
/
Peptides antimicrobiens
/
Antibactériens
langue:
Zh
Texte intégral:
Chinese Journal of Biotechnology
Année:
2023
Type:
Article