Your browser doesn't support javascript.
loading
CircSSNN: circRNA-binding site prediction via sequence self-attention neural networks with pre-normalization.
Cao, Chao; Yang, Shuhong; Li, Mengli; Li, Chungui.
Afiliação
  • Cao C; School of Computer Science and Technology, Guangxi University of Science and Technology, Liuzhou, China.
  • Yang S; Key Laboratory of Guangxi Universities on Intelligent Computing and Distributed Information Processing, Guangxi University of Science and Technology, Liuzhou, China. ysh@hzu.edu.cn.
  • Li M; School of Technology, Guilin University, Guilin, China.
  • Li C; School of Computer Science and Technology, Guangxi University of Science and Technology, Liuzhou, China. liza4323@163.com.
BMC Bioinformatics ; 24(1): 220, 2023 May 30.
Article em En | MEDLINE | ID: mdl-37254080
ABSTRACT

BACKGROUND:

Circular RNAs (circRNAs) play a significant role in some diseases by acting as transcription templates. Therefore, analyzing the interaction mechanism between circRNA and RNA-binding proteins (RBPs) has far-reaching implications for the prevention and treatment of diseases. Existing models for circRNA-RBP identification usually adopt convolution neural network (CNN), recurrent neural network (RNN), or their variants as feature extractors. Most of them have drawbacks such as poor parallelism, insufficient stability, and inability to capture long-term dependencies.

METHODS:

In this paper, we propose a new method completely using the self-attention mechanism to capture deep semantic features of RNA sequences. On this basis, we construct a CircSSNN model for the cirRNA-RBP identification. The proposed model constructs a feature scheme by fusing circRNA sequence representations with statistical distributions, static local contexts, and dynamic global contexts. With a stable and efficient network architecture, the distance between any two positions in a sequence is reduced to a constant, so CircSSNN can quickly capture the long-term dependencies and extract the deep semantic features.

RESULTS:

Experiments on 37 circRNA datasets show that the proposed model has overall advantages in stability, parallelism, and prediction performance. Keeping the network structure and hyperparameters unchanged, we directly apply the CircSSNN to linRNA datasets. The favorable results show that CircSSNN can be transformed simply and efficiently without task-oriented tuning.

CONCLUSIONS:

In conclusion, CircSSNN can serve as an appealing circRNA-RBP identification tool with good identification performance, excellent scalability, and wide application scope without the need for task-oriented fine-tuning of parameters, which is expected to reduce the professional threshold required for hyperparameter tuning in bioinformatics analysis.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / RNA Circular Idioma: En Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / RNA Circular Idioma: En Ano de publicação: 2023 Tipo de documento: Article País de afiliação: China