Your browser doesn't support javascript.
loading
Mistle: bringing spectral library predictions to metaproteomics with an efficient search index.
Nowatzky, Yannek; Benner, Philipp; Reinert, Knut; Muth, Thilo.
Afiliación
  • Nowatzky Y; Section S.3 eScience, Federal Institute for Materials Research and Testing (BAM), Berlin 12205, Germany.
  • Benner P; Section S.3 eScience, Federal Institute for Materials Research and Testing (BAM), Berlin 12205, Germany.
  • Reinert K; Department of Mathematics and Computer Science, FU Berlin, Berlin 14195, Germany.
  • Muth T; Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin 14195, Germany.
Bioinformatics ; 39(6)2023 06 01.
Article en En | MEDLINE | ID: mdl-37294786
ABSTRACT
MOTIVATION Deep learning has moved to the forefront of tandem mass spectrometry-driven proteomics and authentic prediction for peptide fragmentation is more feasible than ever. Still, at this point spectral prediction is mainly used to validate database search results or for confined search spaces. Fully predicted spectral libraries have not yet been efficiently adapted to large search space problems that often occur in metaproteomics or proteogenomics.

RESULTS:

In this study, we showcase a workflow that uses Prosit for spectral library predictions on two common metaproteomes and implement an indexing and search algorithm, Mistle, to efficiently identify experimental mass spectra within the library. Hence, the workflow emulates a classic protein sequence database search with protein digestion but builds a searchable index from spectral predictions as an in-between step. We compare Mistle to popular search engines, both on a spectral and database search level, and provide evidence that this approach is more accurate than a database search using MSFragger. Mistle outperforms other spectral library search engines in terms of run time and proves to be extremely memory efficient with a 4- to 22-fold decrease in RAM usage. This makes Mistle universally applicable to large search spaces, e.g. covering comprehensive sequence databases of diverse microbiomes. AVAILABILITY AND IMPLEMENTATION Mistle is freely available on GitHub at https//github.com/BAMeScience/Mistle.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Péptidos / Programas Informáticos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Péptidos / Programas Informáticos Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Alemania