EnzymeNet: residual neural networks model for Enzyme Commission number prediction.
Bioinform Adv
; 3(1): vbad173, 2023.
Article
en En
| MEDLINE
| ID: mdl-38075476
Motivation: Enzymes are key targets to biosynthesize functional substances in metabolic engineering. Therefore, various machine learning models have been developed to predict Enzyme Commission (EC) numbers, one of the enzyme annotations. However, the previously reported models might predict the sequences with numerous consecutive identical amino acids, which are found within unannotated sequences, as enzymes. Results: Here, we propose EnzymeNet for prediction of complete EC numbers using residual neural networks. EnzymeNet can exclude the exceptional sequences described above. Several EnzymeNet models were built and optimized to explore the best conditions for removing such sequences. As a result, the models exhibited higher prediction accuracy with macro F1 score up to 0.850 than previously reported models. Moreover, even the enzyme sequences with low similarity to training data, which were difficult to predict using the reported models, could be predicted extensively using EnzymeNet models. The robustness of EnzymeNet models will lead to discover novel enzymes for biosynthesis of functional compounds using microorganisms. Availability and implementation: The source code of EnzymeNet models is freely available at https://github.com/nwatanbe/enzymenet.
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Idioma:
En
Revista:
Bioinform Adv
Año:
2023
Tipo del documento:
Article
País de afiliación:
Japón