On selecting features from splice junctions: an analysis using information theoretic and machine learning approaches.
Genome Inform
; 14: 73-83, 2003.
Article
em En
| MEDLINE
| ID: mdl-15706522
ABSTRACT
The computational recognition of precise splice junctions is a challenge faced in the analysis of newly sequenced genomes. This is challenging due to the fact that the distribution of sequence patterns in these regions is not always distinct. Our objective is to understand the sequence signatures at the splice junctions, not simply to create an artificial recognition system. We use a combination of a neural network based calliper randomization approach and an information theoretic based feature selection approach for this purpose. This has been done in an effort to understand regions that harbor information content and to extract features relevant for the prediction of splice junctions. The analysis using the neural network based calliper randomization approach revealed regions important in the internal representation of the network model. The calliper approach captured both correlated as well as independently important features. The feature selection approach captures features that are independently informative. The two different methods can capture features with different properties. Comparative analysis of the results using both the methods help to infer about the kind of information present in the region.
Buscar no Google
Base de dados:
MEDLINE
Assunto principal:
Inteligência Artificial
/
Processamento Alternativo
Tipo de estudo:
Clinical_trials
/
Prognostic_studies
Idioma:
En
Ano de publicação:
2003
Tipo de documento:
Article