Your browser doesn't support javascript.
loading
Splice sites detection using chaos game representation and neural network.
Hoang, Tung; Yin, Changchuan; Yau, Stephen S-T.
Afiliación
  • Hoang T; Department of Mathematics, Statistics, and Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA.
  • Yin C; Department of Mathematics, Statistics, and Computer Science, University of Illinois at Chicago, Chicago, IL 60607, USA.
  • Yau SS; Department of Mathematical Sciences, Tsinghua University, Beijing 100084, P.R. China. Electronic address: yau@uic.edu.
Genomics ; 112(2): 1847-1852, 2020 03.
Article en En | MEDLINE | ID: mdl-31704313
A novel method is proposed to detect the acceptor and donor splice sites using chaos game representation and artificial neural network. In order to achieve high accuracy, inputs to the neural network, or feature vector, shall reflect the true nature of the DNA segments. Therefore it is important to have one-to-one numerical representation, i.e. a feature vector should be able to represent the original data. Chaos game representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane in a one-to-one manner. Using CGR, a DNA sequence can be mapped to a numerical sequence that reflects the true nature of the original sequence. In this research, we propose to use CGR as feature input to a neural network to detect splice sites on the NN269 dataset. Computational experiments indicate that this approach gives good accuracy while being simpler than other methods in the literature, with only one neural network component. The code and data for our method can be accessed from this link: https://github.com/thoang3/portfolio/tree/SpliceSites_ANN_CGR.
Asunto(s)
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Redes Neurales de la Computación / Análisis de Secuencia de ADN / Sitios de Empalme de ARN Tipo de estudio: Diagnostic_studies Límite: Humans Idioma: En Revista: Genomics Asunto de la revista: GENETICA Año: 2020 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Redes Neurales de la Computación / Análisis de Secuencia de ADN / Sitios de Empalme de ARN Tipo de estudio: Diagnostic_studies Límite: Humans Idioma: En Revista: Genomics Asunto de la revista: GENETICA Año: 2020 Tipo del documento: Article País de afiliación: Estados Unidos