Your browser doesn't support javascript.
Parallel continuous flow: a parallel suffix tree construction tool for whole genomes.
J Comput Biol ; 21(4): 330-44, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-24597675
The construction of suffix trees for very long sequences is essential for many applications, and it plays a central role in the bioinformatic domain. With the advent of modern sequencing technologies, biological sequence databases have grown dramatically. Also the methodologies required to analyze these data have become more complex everyday, requiring fast queries to multiple genomes. In this article, we present parallel continuous flow (PCF), a parallel suffix tree construction method that is suitable for very long genomes. We tested our method for the suffix tree construction of the entire human genome, about 3GB. We showed that PCF can scale gracefully as the size of the input genome grows. Our method can work with an efficiency of 90% with 36 processors and 55% with 172 processors. We can index the human genome in 7 minutes using 172 processes.





Texto completo: Disponível Coleções: Bases de dados internacionais Base de dados: MEDLINE Assunto principal: Software / Genoma Humano Limite: Humanos Idioma: Inglês Revista: J Comput Biol Assunto da revista: Biologia Molecular / Informática Médica Ano de publicação: 2014 Tipo de documento: Artigo País de afiliação: Itália