Pesquisa | Portal Regional da BVS

MapReduce implementation of a hybrid spectral library-database search method for large-scale peptide identification.

Kalyanaraman, Ananth; Cannon, William R; Latt, Benjamin; Baxter, Douglas J.

Bioinformatics ; 27(21): 3072-3, 2011 Nov 01.

Artigo em Inglês | MEDLINE | ID: mdl-21926122

RESUMO

SUMMARY: A MapReduce-based implementation called MR-MSPolygraph for parallelizing peptide identification from mass spectrometry data is presented. The underlying serial method, MSPolygraph, uses a novel hybrid approach to match an experimental spectrum against a combination of a protein sequence database and a spectral library. Our MapReduce implementation can run on any Hadoop cluster environment. Experimental results demonstrate that, relative to the serial version, MR-MSPolygraph reduces the time to solution from weeks to hours, for processing tens of thousands of experimental spectra. Speedup and other related performance studies are also reported on a 400-core Hadoop cluster using spectral datasets from environmental microbial communities as inputs. AVAILABILITY: The source code along with user documentation are available on http://compbio.eecs.wsu.edu/MR-MSPolygraph. CONTACT: ananth@eecs.wsu.edu; william.cannon@pnnl.gov. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Espectrometria de Massas/métodos , Peptídeos/química , Software , Bases de Dados de Proteínas , Análise de Sequência de Proteína

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA