Your browser doesn't support javascript.
loading
A deep learning model for DNA enhancer prediction based on nucleotide position aware feature encoding.
Hu, Wenxing; Li, Yelin; Wu, Yan; Guan, Lixin; Li, Mengshan.
Afiliación
  • Hu W; College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.
  • Li Y; College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.
  • Wu Y; College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.
  • Guan L; College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.
  • Li M; College of Physics and Electronic Information, Gannan Normal University, Ganzhou 341000, Jiangxi, China.
iScience ; 27(6): 110030, 2024 Jun 21.
Article en En | MEDLINE | ID: mdl-38868182
ABSTRACT
Enhancers, genomic DNA elements, regulate neighboring gene expression crucial for biological processes like cell differentiation and stress response. However, current machine learning methods for predicting DNA enhancers often underutilize hidden features in gene sequences, limiting model accuracy. Hence, this article proposes the PDCNN model, a deep learning-based enhancer prediction method. PDCNN extracts statistical nucleotide representations from gene sequences, discerning positional distribution information of nucleotides in modifier-like DNA sequences. With a convolutional neural network structure, PDCNN employs dual convolutional and fully connected layers. The cross-entropy loss function iteratively updates using a gradient descent algorithm, enhancing prediction accuracy. Model parameters are fine-tuned to select optimal combinations for training, achieving over 95% accuracy. Comparative analysis with traditional methods and existing models demonstrates PDCNN's robust feature extraction capability. It outperforms advanced machine learning methods in identifying DNA enhancers, presenting an effective method with broad implications for genomics, biology, and medical research.
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Idioma: En Revista: IScience Año: 2024 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Idioma: En Revista: IScience Año: 2024 Tipo del documento: Article País de afiliación: China