Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
J Proteome Res ; 21(8): 2023-2035, 2022 08 05.
Artigo em Inglês | MEDLINE | ID: mdl-35793793

RESUMO

Metaproteomics has been increasingly utilized for high-throughput characterization of proteins in complex environments and has been demonstrated to provide insights into microbial composition and functional roles. However, significant challenges remain in metaproteomic data analysis, including creation of a sample-specific protein sequence database. A well-matched database is a requirement for successful metaproteomics analysis, and the accuracy and sensitivity of PSM identification algorithms suffer when the database is incomplete or contains extraneous sequences. When matched DNA sequencing data of the sample is unavailable or incomplete, creating the proteome database that accurately represents the organisms in the sample is a challenge. Here, we leverage a de novo peptide sequencing approach to identify the sample composition directly from metaproteomic data. First, we created a deep learning model, Kaiko, to predict the peptide sequences from mass spectrometry data and trained it on 5 million peptide-spectrum matches from 55 phylogenetically diverse bacteria. After training, Kaiko successfully identified organisms from soil isolates and synthetic communities directly from proteomics data. Finally, we created a pipeline for metaproteome database generation using Kaiko. We tested the pipeline on native soils collected in Kansas, showing that the de novo sequencing model can be employed as an alternative and complementary method to construct the sample-specific protein database instead of relying on (un)matched metagenomes. Our pipeline identified all highly abundant taxa from 16S rRNA sequencing of the soil samples and uncovered several additional species which were strongly represented only in proteomic data.


Assuntos
Microbiota , Proteômica , Microbiota/genética , Peptídeos/análise , Peptídeos/genética , Proteoma/genética , Proteômica/métodos , RNA Ribossômico 16S/genética , Solo
2.
Anal Chem ; 91(21): 13372-13376, 2019 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-31596564

RESUMO

Ricin, a toxic protein from the castor plant, is of forensic and biosecurity interest because of its high toxicity and common occurrence in crimes and attempted crimes. Qualitative methods to detect ricin are therefore needed. Untargeted liquid chromatography-tandem mass spectrometry (LC-MS/MS) proteomics methods are well suited because of their high specificity. Specificity in LC-MS/MS comes from both the LC and MS components. However, modern untargeted proteomics methods often use nanoflow LC, which has less reproducible retention times than standard-flow LC, making it challenging to use retention time as a point of identification in a forensic assay. We address this challenge by using retention times relative to a standard, namely, the uniformly 15N-labeled ricin A-chain produced recombinantly in a bacterial expression system. This material, added as an internal standard prior to trypsin digestion, produces a stable-isotope-labeled standard for every ricin tryptic peptide in the sample. We show that the MS signals for 15N and natural isotopic abundance ricin peptides are distinct, with mass shifts that correspond to the numbers of nitrogen atoms in each peptide or fragment. We also show that, as expected, labeled and unlabeled peptides coelute, with relative retention time differences of less than 0.2%.


Assuntos
Cromatografia Líquida/métodos , Ciências Forenses/métodos , Marcação por Isótopo , Nanotecnologia/métodos , Ricina/química , Espectrometria de Massas em Tandem/métodos , Isótopos de Nitrogênio , Proteínas Recombinantes
3.
Anal Chem ; 91(19): 12399-12406, 2019 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-31490662

RESUMO

Robust and highly specific methods for the detection of the protein toxin ricin are of interest to the law enforcement community. In previous studies, methods based on liquid chromatography-tandem mass spectrometry shotgun proteomics have been proposed. The successful implementation of this approach relies on specific data evaluation criteria addressing (1) the quality of the mass spectrometric data, (2) the confidence of peptide identifications (peptide-spectrum matches), and (3) the number and sequence specificity of peptides detected. We present such data evaluation criteria and use a novel approach to establish the limit of detection for this ricin assay. Specifically, we use logistic regression to determine the probability of detection for individual ricin peptides at different concentrations. We then apply basic rules from probability theory, combining these individual peptide probabilities into an overall assay limit of detection. This procedure yields an assay limit of detection for ricin at 42.5 ng on column or 21.25 ng/µL for a 2-µL injection. We also show that, despite the conventional wisdom that detergents are deleterious to mass spectrometric analyses, the presence of Tween-20 did not prevent detection of ricin peptides, and indeed assays performed in buffers that included Tween-20 gave better results than assays performed using other buffer formulations with or without detergent removal.


Assuntos
Limite de Detecção , Proteômica/métodos , Ricina/análise , Sequência de Aminoácidos , Polissorbatos/química , Ricina/química , Ricina/metabolismo
4.
J Proteome Res ; 17(9): 3075-3085, 2018 09 07.
Artigo em Inglês | MEDLINE | ID: mdl-30109807

RESUMO

Bottom-up proteomics is increasingly being used to characterize unknown environmental, clinical, and forensic samples. Proteomics-based bacterial identification typically proceeds by tabulating peptide "hits" (i.e., confidently identified peptides) associated with the organisms in a database; those organisms with enough hits are declared present in the sample. This approach has proven to be successful in laboratory studies; however, important research gaps remain. First, the common-practice reliance on unique peptides for identification is susceptible to a phenomenon known as signal erosion. Second, no general guidelines are available for determining how many hits are needed to make a confident identification. These gaps inhibit the transition of this approach to real-world forensic samples where conditions vary and large databases may be needed. In this work, we propose statistical criteria that overcome the problem of signal erosion and can be applied regardless of the sample quality or data analysis pipeline. These criteria are straightforward, producing a p-value on the result of an organism or toxin identification. We test the proposed criteria on 919 LC-MS/MS data sets originating from 2 toxins and 32 bacterial strains acquired using multiple data collection platforms. Results reveal a > 95% correct species-level identification rate, demonstrating the effectiveness and robustness of proteomics-based organism/toxin identification.


Assuntos
Toxinas Bacterianas/isolamento & purificação , Ciências Forenses/métodos , Peptídeos/análise , Proteômica/estatística & dados numéricos , Bacillus/química , Bacillus/patogenicidade , Bacillus/fisiologia , Toxinas Bacterianas/química , Cromatografia Líquida , Clostridium/química , Clostridium/patogenicidade , Clostridium/fisiologia , Interpretação Estatística de Dados , Desulfovibrio/química , Desulfovibrio/patogenicidade , Desulfovibrio/fisiologia , Escherichia/química , Escherichia/patogenicidade , Escherichia/fisiologia , Ciências Forenses/instrumentação , Ciências Forenses/estatística & dados numéricos , Humanos , Peptídeos/química , Probabilidade , Proteômica/métodos , Pseudomonas/química , Pseudomonas/patogenicidade , Pseudomonas/fisiologia , Salmonella/química , Salmonella/patogenicidade , Salmonella/fisiologia , Sensibilidade e Especificidade , Shewanella/química , Shewanella/patogenicidade , Shewanella/fisiologia , Espectrometria de Massas em Tandem , Yersinia/química , Yersinia/patogenicidade , Yersinia/fisiologia
5.
Front Med (Lausanne) ; 9: 821071, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35223919

RESUMO

Antimicrobial resistance (AMR) is a well-recognized, widespread, and growing issue of concern. With increasing incidence of AMR, the ability to respond quickly to infection with or exposure to an AMR pathogen is critical. Approaches that could accurately and more quickly identify whether a pathogen is AMR also are needed to more rapidly respond to existing and emerging biological threats. We examined proteins associated with paired AMR and antimicrobial susceptible (AMS) strains of Yersinia pestis and Francisella tularensis, causative agents of the diseases plague and tularemia, respectively, to identify whether potential existed to use proteins as signatures of AMR. We found that protein expression was significantly impacted by AMR status. Antimicrobial resistance-conferring proteins were expressed even in the absence of antibiotics in growth media, and the abundance of 10-20% of cellular proteins beyond those that directly confer AMR also were significantly changed in both Y. pestis and F. tularensis. Most strikingly, the abundance of proteins involved in specific metabolic pathways and biological functions was altered in all AMR strains examined, independent of species, resistance mechanism, and affected cellular antimicrobial target. We have identified features that distinguish between AMR and AMS strains, including a subset of features shared across species with different resistance mechanisms, which suggest shared biological signatures of resistance. These features could form the basis of novel approaches to identify AMR phenotypes in unknown strains.

6.
Protein Sci ; 29(9): 1864-1878, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32713088

RESUMO

Mass spectrometry-based proteomics is a popular and powerful method for precise and highly multiplexed protein identification. The most common method of analyzing untargeted proteomics data is called database searching, where the database is simply a collection of protein sequences from the target organism, derived from genome sequencing. Experimental peptide tandem mass spectra are compared to simplified models of theoretical spectra calculated from the translated genomic sequences. However, in several interesting application areas, such as forensics, archaeology, venomics, and others, a genome sequence may not be available, or the correct genome sequence to use is not known. In these cases, de novo peptide identification can play an important role. De novo methods infer peptide sequence directly from the tandem mass spectrum without reference to a sequence database, usually using graph-based or machine learning algorithms. In this review, we provide a basic overview of de novo peptide identification methods and applications, briefly covering de novo algorithms and tools, and focusing in more depth on recent applications from venomics, metaproteomics, forensics, and characterization of antibody drugs.


Assuntos
Bases de Dados de Proteínas , Peptídeos/análise , Espectrometria de Massas em Tandem
7.
Toxicon ; 140: 18-31, 2017 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-29031940

RESUMO

The toxic protein ricin (also known as RCA60), found in the seed of the castor plant (Ricinus communis) is frequently encountered in law enforcement investigations. The ability to detect ricin by analyzing its proteolytic (tryptic) peptides by liquid chromatography-tandem mass spectrometry (LC-MS/MS) is well established. However, ricin is just one member of a family of proteins in R. communis with closely related amino acid sequences, including R. communis agglutinin I (RCA120) and other ricin-like proteins (RLPs). Inferring the presence of ricin from its constituent peptides requires an understanding of the specificity, or uniqueness to ricin, of each peptide. Here we describe the set of ricin-derived tryptic peptides that can serve to uniquely identify ricin in distinction to closely-related RLPs and to proteins from other species. Other ricin-derived peptide sequences occur only in the castor plant, and still others are shared with unrelated species. We also characterized the occurrence and relative abundance of ricin and related proteins in an assortment of forensically relevant crude castor seed preparations. We find that whereas ricin and RCA120 are abundant in castor seed extracts, other RLPs are not represented by abundant unique peptides. Therefore, the detection of peptides shared between ricin and RLPs (other than RCA120) in crude castor seed extracts most likely reflects the presence of ricin in the sample.


Assuntos
Substâncias para a Guerra Química/análise , Ricina/análise , Ricinus communis/química , Sequência de Aminoácidos , Substâncias para a Guerra Química/química , Cromatografia Líquida , Peptídeos/análise , Extratos Vegetais/química , Proteínas de Plantas/análise , Ricina/química , Sementes/química , Espectrometria de Massas em Tandem
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA