Your browser doesn't support javascript.
loading
Enhancer Recognition: A Transformer Encoder-Based Method with WGAN-GP for Data Augmentation.
Feng, Tianyu; Hu, Tao; Liu, Wenyu; Zhang, Yang.
Afiliación
  • Feng T; College of Information Science & Engineering, Lanzhou University, Lanzhou 730000, China.
  • Hu T; College of Information Science & Engineering, Lanzhou University, Lanzhou 730000, China.
  • Liu W; College of Ecology, Lanzhou University, Lanzhou 730000, China.
  • Zhang Y; Supercomputer Center, Lanzhou University, Lanzhou 730000, China.
Int J Mol Sci ; 24(24)2023 Dec 16.
Article en En | MEDLINE | ID: mdl-38139375
ABSTRACT
Enhancers are located upstream or downstream of key deoxyribonucleic acid (DNA) sequences in genes and can adjust the transcription activity of neighboring genes. Identifying enhancers and determining their functions are important for understanding gene regulatory networks and expression regulatory mechanisms. However, traditional enhancer recognition relies on manual feature engineering, which is time-consuming and labor-intensive, making it difficult to perform large-scale recognition analysis. In addition, if the original dataset is too small, there is a risk of overfitting. In recent years, emerging methods, such as deep learning, have provided new insights for enhancing identification. However, these methods also present certain challenges. Deep learning models typically require a large amount of high-quality data, and data acquisition demands considerable time and resources. To address these challenges, in this paper, we propose a data-augmentation method based on generative adversarial networks to solve the problem of small datasets. Moreover, we used regularization methods such as weight decay to improve the generalizability of the model and alleviate overfitting. The Transformer encoder was used as the main component to capture the complex relationships and dependencies in enhancer sequences. The encoding layer was designed based on the principle of k-mers to preserve more information from the original DNA sequence. Compared with existing methods, the proposed approach made significant progress in enhancing the accuracy and strength of enhancer identification and prediction, demonstrating the effectiveness of the proposed method. This paper provides valuable insights for enhancer analysis and is of great significance for understanding gene regulatory mechanisms and studying disease correlations.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Trabajo de Parto / Redes Reguladoras de Genes Límite: Female / Humans / Pregnancy Idioma: En Revista: Int J Mol Sci Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Trabajo de Parto / Redes Reguladoras de Genes Límite: Female / Humans / Pregnancy Idioma: En Revista: Int J Mol Sci Año: 2023 Tipo del documento: Article País de afiliación: China