Splice sites detection using chaos game representation and neural network.
Genomics
; 112(2): 1847-1852, 2020 03.
Article
en En
| MEDLINE
| ID: mdl-31704313
A novel method is proposed to detect the acceptor and donor splice sites using chaos game representation and artificial neural network. In order to achieve high accuracy, inputs to the neural network, or feature vector, shall reflect the true nature of the DNA segments. Therefore it is important to have one-to-one numerical representation, i.e. a feature vector should be able to represent the original data. Chaos game representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane in a one-to-one manner. Using CGR, a DNA sequence can be mapped to a numerical sequence that reflects the true nature of the original sequence. In this research, we propose to use CGR as feature input to a neural network to detect splice sites on the NN269 dataset. Computational experiments indicate that this approach gives good accuracy while being simpler than other methods in the literature, with only one neural network component. The code and data for our method can be accessed from this link: https://github.com/thoang3/portfolio/tree/SpliceSites_ANN_CGR.
Palabras clave
Texto completo:
1
Bases de datos:
MEDLINE
Asunto principal:
Redes Neurales de la Computación
/
Análisis de Secuencia de ADN
/
Sitios de Empalme de ARN
Tipo de estudio:
Diagnostic_studies
Límite:
Humans
Idioma:
En
Revista:
Genomics
Asunto de la revista:
GENETICA
Año:
2020
Tipo del documento:
Article
País de afiliación:
Estados Unidos