Your browser doesn't support javascript.
loading
Alignment-free distance measure based on return time distribution for sequence analysis: applications to clustering, molecular phylogeny and subtyping.
Kolekar, Pandurang; Kale, Mohan; Kulkarni-Kale, Urmila.
Afiliação
  • Kolekar P; Bioinformatics Centre, University of Pune, Pune 411 007, India. pandurang@bioinfo.net.in
Mol Phylogenet Evol ; 65(2): 510-22, 2012 Nov.
Article em En | MEDLINE | ID: mdl-22820020
ABSTRACT
The data deluge in post-genomic era demands development of novel data mining tools. Existing molecular phylogeny analyses (MPAs) developed for individual gene/protein sequences are alignment-based. However, the size of genomic data and uncertainties associated with alignments, necessitate development of alignment-free methods for MPA. Derivation of distances between sequences is an important step in both, alignment-dependant and alignment-free methods. Various alignment-free distance measures based on oligo-nucleotide frequencies, information content, compression techniques, etc. have been proposed. However, these distance measures do not account for relative order of components viz. nucleotides or amino acids. A new distance measure, based on the concept of 'return time distribution' (RTD) of k-mers is proposed, which accounts for the sequence composition and their relative orders. Statistical parameters of RTDs are used to derive a distance function. The resultant distance matrix is used for clustering and phylogeny using Neighbor-joining. Its performance for MPA and subtyping was evaluated using simulated data generated by block-bootstrap, receiver operating characteristics and leave-one-out cross validation methods. The proposed method was successfully applied for MPA of family Flaviviridae and subtyping of Dengue viruses. It is observed that method retains resolution for classification and subtyping of viruses at varying levels of sequence similarity and taxonomic hierarchy.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Filogenia / Análise de Sequência Idioma: En Revista: Mol Phylogenet Evol Assunto da revista: BIOLOGIA / BIOLOGIA MOLECULAR Ano de publicação: 2012 Tipo de documento: Article País de afiliação: Índia

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Filogenia / Análise de Sequência Idioma: En Revista: Mol Phylogenet Evol Assunto da revista: BIOLOGIA / BIOLOGIA MOLECULAR Ano de publicação: 2012 Tipo de documento: Article País de afiliação: Índia