Your browser doesn't support javascript.
loading
Multiple sequence alignment based on deep reinforcement learning with self-attention and positional encoding.
Liu, Yuhang; Yuan, Hao; Zhang, Qiang; Wang, Zixuan; Xiong, Shuwen; Wen, Naifeng; Zhang, Yongqing.
Afiliación
  • Liu Y; School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China.
  • Yuan H; School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China.
  • Zhang Q; School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China.
  • Wang Z; College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China.
  • Xiong S; School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China.
  • Wen N; School of Mechanical and Electrical Engineering, Dalian Minzu University, Dalian 116600, China.
  • Zhang Y; School of Computer Science, Chengdu University of Information Technology, Chengdu 610225, China.
Bioinformatics ; 39(11)2023 11 01.
Article en En | MEDLINE | ID: mdl-37856335
MOTIVATION: Multiple sequence alignment (MSA) is one of the hotspots of current research and is commonly used in sequence analysis scenarios. However, there is no lasting solution for MSA because it is a Nondeterministic Polynomially complete problem, and the existing methods still have room to improve the accuracy. RESULTS: We propose Deep reinforcement learning with Positional encoding and self-Attention for MSA, based on deep reinforcement learning, to enhance the accuracy of the alignment Specifically, inspired by the translation technique in natural language processing, we introduce self-attention and positional encoding to improve accuracy and reliability. Firstly, positional encoding encodes the position of the sequence to prevent the loss of nucleotide position information. Secondly, the self-attention model is used to extract the key features of the sequence. Then input the features into a multi-layer perceptron, which can calculate the insertion position of the gap according to the features. In addition, a novel reinforcement learning environment is designed to convert the classic progressive alignment into progressive column alignment, gradually generating each column's sub-alignment. Finally, merge the sub-alignment into the complete alignment. Extensive experiments based on several datasets validate our method's effectiveness for MSA, outperforming some state-of-the-art methods in terms of the Sum-of-pairs and Column scores. AVAILABILITY AND IMPLEMENTATION: The process is implemented in Python and available as open-source software from https://github.com/ZhangLab312/DPAMSA.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Algoritmos / Programas Informáticos Idioma: En Año: 2023 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Algoritmos / Programas Informáticos Idioma: En Año: 2023 Tipo del documento: Article