Pesquisa | Biblioteca Virtual em Saúde

1.

The SysteMHC Atlas v2.0, an updated resource for mass spectrometry-based immunopeptidomics.

Huang, Xiaoxiang; Gan, Ziao; Cui, Haowei; Lan, Tian; Liu, Yansheng; Caron, Etienne; Shao, Wenguang.

Nucleic Acids Res ; 52(D1): D1062-D1071, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-38000392

RESUMO

The SysteMHC Atlas v1.0 was the first public repository dedicated to mass spectrometry-based immunopeptidomics. Here we introduce a newly released version of the SysteMHC Atlas v2.0 (https://systemhc.sjtu.edu.cn), a comprehensive collection of 7190 MS files from 303 allotypes. We extended and optimized a computational pipeline that allows the identification of MHC-bound peptides carrying on unexpected post-translational modifications (PTMs), thereby resulting in 471K modified peptides identified over 60 distinct PTM types. In total, we identified approximately 1.0 million and 1.1 million unique peptides for MHC class I and class II immunopeptidomes, respectively, indicating a 6.8-fold increase and a 28-fold increase to those in v1.0. The SysteMHC Atlas v2.0 introduces several new features, including the inclusion of non-UniProt peptides, and the incorporation of several novel computational tools for FDR estimation, binding affinity prediction and motif deconvolution. Additionally, we enhanced the user interface, upgraded website framework, and provided external links to other resources related. Finally, we built and provided various spectral libraries as community resources for data mining and future immunopeptidomic and proteomic analysis. We believe that the SysteMHC Atlas v2.0 is a unique resource to provide key insights to the immunology and proteomics community and will accelerate the development of vaccines and immunotherapies.

Assuntos

Bases de Dados de Proteínas , Peptídeos , Proteômica , Espectrometria de Massas , Peptídeos/química , Peptídeos/imunologia , Processamento de Proteína Pós-Traducional , Proteômica/métodos , Bases de Dados de Proteínas/normas , Internet , Humanos , Animais

2.

Proteomic-based stratification of intermediate-risk prostate cancer patients.

Zhong, Qing; Sun, Rui; Aref, Adel T; Noor, Zainab; Anees, Asim; Zhu, Yi; Lucas, Natasha; Poulos, Rebecca C; Lyu, Mengge; Zhu, Tiansheng; Chen, Guo-Bo; Wang, Yingrui; Ding, Xuan; Rutishauser, Dorothea; Rupp, Niels J; Rueschoff, Jan H; Poyet, Cédric; Hermanns, Thomas; Fankhauser, Christian; Rodríguez Martínez, María; Shao, Wenguang; Buljan, Marija; Neumann, Janis Frederick; Beyer, Andreas; Hains, Peter G; Reddel, Roger R; Robinson, Phillip J; Aebersold, Ruedi; Guo, Tiannan; Wild, Peter J.

Life Sci Alliance ; 7(2)2024 02.

Artigo em Inglês | MEDLINE | ID: mdl-38052461

RESUMO

Gleason grading is an important prognostic indicator for prostate adenocarcinoma and is crucial for patient treatment decisions. However, intermediate-risk patients diagnosed in the Gleason grade group (GG) 2 and GG3 can harbour either aggressive or non-aggressive disease, resulting in under- or overtreatment of a significant number of patients. Here, we performed proteomic, differential expression, machine learning, and survival analyses for 1,348 matched tumour and benign sample runs from 278 patients. Three proteins (F5, TMEM126B, and EARS2) were identified as candidate biomarkers in patients with biochemical recurrence. Multivariate Cox regression yielded 18 proteins, from which a risk score was constructed to dichotomize prostate cancer patients into low- and high-risk groups. This 18-protein signature is prognostic for the risk of biochemical recurrence and completely independent of the intermediate GG. Our results suggest that markers generated by computational proteomic profiling have the potential for clinical applications including integration into prostate cancer management.

Assuntos

Neoplasias da Próstata , Proteômica , Masculino , Humanos , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Fatores de Risco , Gradação de Tumores

3.

A computational framework for the inference of protein complex remodeling from whole-proteome measurements.

Buljan, Marija; Banaei-Esfahani, Amir; Blattmann, Peter; Meier-Abt, Fabienne; Shao, Wenguang; Vitek, Olga; Tang, Hua; Aebersold, Ruedi.

Nat Methods ; 20(10): 1523-1529, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37749212

RESUMO

Protein complexes are responsible for the enactment of most cellular functions. For the protein complex to form and function, its subunits often need to be present at defined quantitative ratios. Typically, global changes in protein complex composition are assessed with experimental approaches that tend to be time consuming. Here, we have developed a computational algorithm for the detection of altered protein complexes based on the systematic assessment of subunit ratios from quantitative proteomic measurements. We applied it to measurements from breast cancer cell lines and patient biopsies and were able to identify strong remodeling of HDAC2 epigenetic complexes in more aggressive forms of cancer. The presented algorithm is available as an R package and enables the inference of changes in protein complex states by extracting functionally relevant information from bottom-up proteomic datasets.

Assuntos

Proteoma , Proteômica , Humanos , Proteoma/metabolismo , Algoritmos , Células MCF-7 , Biologia Computacional

4.

Integrated multi-omics reveals anaplerotic rewiring in methylmalonyl-CoA mutase deficiency.

Forny, Patrick; Bonilla, Ximena; Lamparter, David; Shao, Wenguang; Plessl, Tanja; Frei, Caroline; Bingisser, Anna; Goetze, Sandra; van Drogen, Audrey; Harshman, Keith; Pedrioli, Patrick G A; Howald, Cedric; Poms, Martin; Traversi, Florian; Bürer, Céline; Cherkaoui, Sarah; Morscher, Raphael J; Simmons, Luke; Forny, Merima; Xenarios, Ioannis; Aebersold, Ruedi; Zamboni, Nicola; Rätsch, Gunnar; Dermitzakis, Emmanouil T; Wollscheid, Bernd; Baumgartner, Matthias R; Froese, D Sean.

Nat Metab ; 5(1): 80-95, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36717752

RESUMO

Methylmalonic aciduria (MMA) is an inborn error of metabolism with multiple monogenic causes and a poorly understood pathogenesis, leading to the absence of effective causal treatments. Here we employ multi-layered omics profiling combined with biochemical and clinical features of individuals with MMA to reveal a molecular diagnosis for 177 out of 210 (84%) cases, the majority (148) of whom display pathogenic variants in methylmalonyl-CoA mutase (MMUT). Stratification of these data layers by disease severity shows dysregulation of the tricarboxylic acid cycle and its replenishment (anaplerosis) by glutamine. The relevance of these disturbances is evidenced by multi-organ metabolomics of a hemizygous Mmut mouse model as well as through identification of physical interactions between MMUT and glutamine anaplerotic enzymes. Using stable-isotope tracing, we find that treatment with dimethyl-oxoglutarate restores deficient tricarboxylic acid cycling. Our work highlights glutamine anaplerosis as a potential therapeutic intervention point in MMA.

Assuntos

Erros Inatos do Metabolismo , Metilmalonil-CoA Mutase , Camundongos , Animais , Metilmalonil-CoA Mutase/genética , Metilmalonil-CoA Mutase/metabolismo , Glutamina , Multiômica , Erros Inatos do Metabolismo/genética

5.

PCfun: a hybrid computational framework for systematic characterization of protein complex function.

Sharma, Varun S; Fossati, Andrea; Ciuffa, Rodolfo; Buljan, Marija; Williams, Evan G; Chen, Zhen; Shao, Wenguang; Pedrioli, Patrick G A; Purcell, Anthony W; Martínez, María Rodríguez; Song, Jiangning; Manica, Matteo; Aebersold, Ruedi; Li, Chen.

Brief Bioinform ; 23(4)2022 07 18.

Artigo em Inglês | MEDLINE | ID: mdl-35724564

RESUMO

In molecular biology, it is a general assumption that the ensemble of expressed molecules, their activities and interactions determine biological function, cellular states and phenotypes. Stable protein complexes-or macromolecular machines-are, in turn, the key functional entities mediating and modulating most biological processes. Although identifying protein complexes and their subunit composition can now be done inexpensively and at scale, determining their function remains challenging and labor intensive. This study describes Protein Complex Function predictor (PCfun), the first computational framework for the systematic annotation of protein complex functions using Gene Ontology (GO) terms. PCfun is built upon a word embedding using natural language processing techniques based on 1 million open access PubMed Central articles. Specifically, PCfun leverages two approaches for accurately identifying protein complex function, including: (i) an unsupervised approach that obtains the nearest neighbor (NN) GO term word vectors for a protein complex query vector and (ii) a supervised approach using Random Forest (RF) models trained specifically for recovering the GO terms of protein complex queries described in the CORUM protein complex database. PCfun consolidates both approaches by performing a hypergeometric statistical test to enrich the top NN GO terms within the child terms of the GO terms predicted by the RF models. The documentation and implementation of the PCfun package are available at https://github.com/sharmavaruns/PCfun. We anticipate that PCfun will serve as a useful tool and novel paradigm for the large-scale characterization of protein complex function.

Assuntos

Biologia Computacional , Proteínas , Biologia Computacional/métodos , Bases de Dados de Proteínas , Ontologia Genética , Humanos , Processamento de Linguagem Natural

6.

Data-independent acquisition-based proteome and phosphoproteome profiling across six melanoma cell lines reveals determinants of proteotypes.

Gao, Erli; Li, Wenxue; Wu, Chongde; Shao, Wenguang; Di, Yi; Liu, Yansheng.

Mol Omics ; 17(3): 413-425, 2021 06 14.

Artigo em Inglês | MEDLINE | ID: mdl-33728422

RESUMO

Human cancer cell lines are widely used in pharmacological and systems biological studies. The rapid documentation of the steady-state gene expression landscape of the cells used in a particular experiment may help to improve the reproducibility of scientific research. Here we applied a data-independent acquisition mass spectrometry (DIA-MS) method, coupled with a peptide spectral-library-free data analysis workflow, to measure both the proteome and phosphoproteome of a melanoma cell line panel with different metastatic properties. For each cell line, the single-shot DIA-MS detected 8100 proteins and almost 40 000 phosphopeptides in the respective measurements of two hours. Benchmarking the DIA-MS data towards the RNA-seq data and tandem mass tag (TMT)-MS results from the same set of cell lines demonstrated comparable qualitative coverage and quantitative reproducibility. Our data confirmed the high but complex mRNA-protein and protein-phospsite correlations. The results successfully established DIA-MS as a strong and competitive proteotyping approach for cell lines. The data further showed that all subunits of the glycosylphosphatidylinositol (GPI)-anchor transamidase complex were overexpressed in metastatic melanoma cells and identified altered phosphoprotein modules such as the BAF complex and mRNA splicing between metastatic and primary cells. This study provides a high-quality resource for calibrating DIA-MS performance, benchmarking DIA bioinformatic algorithms, and exploring the metastatic proteotypes in melanoma cells.

Assuntos

Biologia Computacional/métodos , Melanoma/metabolismo , Fosfoproteínas/análise , Mapas de Interação de Proteínas , Proteômica/métodos , Linhagem Celular Tumoral , Cromatografia Líquida , Perfilação da Expressão Gênica , Humanos , Melanoma/genética , Metástase Neoplásica , Fosfoproteínas/genética , Análise de Sequência de RNA , Espectrometria de Massas em Tandem

7.

Differential Diagnosis of Preeclampsia Based on Urine Peptidome Features Revealed by High Resolution Mass Spectrometry.

Kononikhin, Alexey S; Zakharova, Natalia V; Sergeeva, Viktoria A; Indeykina, Maria I; Starodubtseva, Natalia L; Bugrova, Anna E; Muminova, Kamila T; Khodzhaeva, Zulfia S; Popov, Igor A; Shao, Wenguang; Pedrioli, Patrik; Shmakov, Roman G; Frankevich, Vladimir E; Sukhikh, Gennady T; Nikolaev, Evgeny N.

Diagnostics (Basel) ; 10(12)2020 Dec 03.

Artigo em Inglês | MEDLINE | ID: mdl-33287124

RESUMO

Preeclampsia (PE) is a severe pregnancy complication, which may be considered as a systemic response in the second half of pregnancy to physiological failures in the first trimester, and can lead to very serious consequences for the health of the mother and fetus. Since PE is often associated with proteinuria, urine proteomic assays may represent a powerful tool for timely diagnostics and appropriate management. High resolution mass spectrometry was applied for peptidome analysis of 127 urine samples of pregnant women with various hypertensive complications: normotensive controls (n = 17), chronic hypertension (n = 16), gestational hypertension (n = 15), mild PE (n = 25), severe PE (n = 25), and 29 patients with complicated diagnoses. Analysis revealed 3869 peptides, which mostly belong to 116 groups with overlapping sequences. A panel of 22 marker peptide groups reliably differentiating PE was created by multivariate statistics, and included 15 collagen groups (from COL1A1, COL3A1, COL2A1, COL4A4, COL5A1, and COL8A1), and single loci from alpha-1-antitrypsin, fibrinogen, membrane-associated progesterone receptor component 1, insulin, EMI domain-containing protein 1, lysine-specific demethylase 6B, and alpha-2-HS-glycoprotein each. ROC analysis of the created model resulted in 88% sensitivity, 96.8% specificity, and receiver operating characteristic curve (AUC) = 0.947. Obtained results confirm the high diagnostic potential of urinary peptidome profiling for pregnancy hypertensive disorders diagnostics.

8.

The SysteMHC Atlas: a Computational Pipeline, a Website, and a Data Repository for Immunopeptidomic Analyses.

Shao, Wenguang; Caron, Etienne; Pedrioli, Patrick; Aebersold, Ruedi.

Methods Mol Biol ; 2120: 173-181, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-32124319

RESUMO

Mass spectrometry has emerged as the method of choice for the exploration of the immunopeptidome. Insights from the immunopeptidome promise novel cancer therapeutic approaches and a better understanding of the basic mechanisms of our immune system. To meet the computational demands from the steady gain in popularity and reach of mass spectrometry-based immunopeptidomics analysis, we created the SysteMHC Atlas project, a first-of-its-kind computational pipeline and resource repository dedicated to standardizing data analysis and public dissemination of immunopeptidomic datasets.

Assuntos

Antígenos HLA , Complexo Principal de Histocompatibilidade , Espectrometria de Massas/métodos , Proteômica/métodos , Alelos , Antígenos HLA/química , Antígenos HLA/genética , Antígenos HLA/imunologia , Humanos , Internet , Neoplasias/genética , Neoplasias/imunologia , Software

9.

Comparative analysis of mRNA and protein degradation in prostate tissues indicates high stability of proteins.

Shao, Wenguang; Guo, Tiannan; Toussaint, Nora C; Xue, Peng; Wagner, Ulrich; Li, Li; Charmpi, Konstantina; Zhu, Yi; Wu, Jianmin; Buljan, Marija; Sun, Rui; Rutishauser, Dorothea; Hermanns, Thomas; Fankhauser, Christian Daniel; Poyet, Cedric; Ljubicic, Jelena; Rupp, Niels; Rüschoff, Jan H; Zhong, Qing; Beyer, Andreas; Ji, Jiafu; Collins, Ben C; Liu, Yansheng; Rätsch, Gunnar; Wild, Peter J; Aebersold, Ruedi.

Nat Commun ; 10(1): 2524, 2019 06 07.

Artigo em Inglês | MEDLINE | ID: mdl-31175306

RESUMO

Deterioration of biomolecules in clinical tissues is an inevitable pre-analytical process, which affects molecular measurements and thus potentially confounds conclusions from cohort analyses. Here, we investigate the degradation of mRNA and protein in 68 pairs of adjacent prostate tissue samples using RNA-Seq and SWATH mass spectrometry, respectively. To objectively quantify the extent of protein degradation, we develop a numerical score, the Proteome Integrity Number (PIN), that faithfully measures the degree of protein degradation. Our results indicate that protein degradation only affects 5.9% of the samples tested and shows negligible correlation with mRNA degradation in the adjacent samples. These findings are confirmed by independent analyses on additional clinical sample cohorts and across different mass spectrometric methods. Overall, the data show that the majority of samples tested are not compromised by protein degradation, and establish the PIN score as a generic and accurate indicator of sample quality for proteomic analyses.

Assuntos

Próstata/metabolismo , Neoplasias da Próstata/metabolismo , Proteínas/metabolismo , Proteólise , Estabilidade de RNA , RNA Mensageiro/metabolismo , Idoso , Humanos , Masculino , Espectrometria de Massas , Pessoa de Meia-Idade , Análise de Sequência de RNA

10.

A tissue-based draft map of the murine MHC class I immunopeptidome.

Schuster, Heiko; Shao, Wenguang; Weiss, Tobias; Pedrioli, Patrick G A; Roth, Patrick; Weller, Michael; Campbell, David S; Deutsch, Eric W; Moritz, Robert L; Planz, Oliver; Rammensee, Hans-Georg; Aebersold, Ruedi; Caron, Etienne.

Sci Data ; 5: 180157, 2018 08 07.

Artigo em Inglês | MEDLINE | ID: mdl-30084848

RESUMO

The large array of peptides presented to CD8+ T cells by major histocompatibility complex (MHC) class I molecules is referred to as the MHC class I immunopeptidome. Although the MHC class I immunopeptidome is ubiquitous in mammals and represents a critical component of the immune system, very little is known, in any species, about its composition across most tissues and organs in vivo. We applied mass spectrometry (MS) technologies to draft the first tissue-based atlas of the murine MHC class I immunopeptidome in health. Peptides were extracted from 19 normal tissues from C57BL/6 mice and prepared for MS injections, resulting in a total number of 28,448 high-confidence H2Db/Kb-associated peptides identified and annotated in the atlas. This atlas provides initial qualitative data to explore the tissue-specificity of the immunopeptidome and serves as a guide to identify potential tumor-associated antigens from various cancer models. Our data were shared via PRIDE (PXD008733), SysteMHC Atlas (SYSMHC00018) and SWATH Atlas. We anticipate that this unique dataset will be expanded in the future and will find wide applications in basic and translational immunology.

Assuntos

Antígenos de Histocompatibilidade Classe I , Especificidade de Órgãos , Animais , Antígenos de Histocompatibilidade Classe I/análise , Antígenos de Histocompatibilidade Classe I/imunologia , Espectrometria de Massas , Camundongos , Camundongos Endogâmicos C57BL , Peptídeos

11.

The SysteMHC Atlas project.

Shao, Wenguang; Pedrioli, Patrick G A; Wolski, Witold; Scurtescu, Cristian; Schmid, Emanuel; Vizcaíno, Juan A; Courcelles, Mathieu; Schuster, Heiko; Kowalewski, Daniel; Marino, Fabio; Arlehamn, Cecilia S L; Vaughan, Kerrie; Peters, Bjoern; Sette, Alessandro; Ottenhoff, Tom H M; Meijgaarden, Krista E; Nieuwenhuizen, Natalie; Kaufmann, Stefan H E; Schlapbach, Ralph; Castle, John C; Nesvizhskii, Alexey I; Nielsen, Morten; Deutsch, Eric W; Campbell, David S; Moritz, Robert L; Zubarev, Roman A; Ytterberg, Anders Jimmy; Purcell, Anthony W; Marcilla, Miguel; Paradela, Alberto; Wang, Qi; Costello, Catherine E; Ternette, Nicola; van Veelen, Peter A; van Els, Cécile A C M; Heck, Albert J R; de Souza, Gustavo A; Sollid, Ludvig M; Admon, Arie; Stevanovic, Stefan; Rammensee, Hans-Georg; Thibault, Pierre; Perreault, Claude; Bassani-Sternberg, Michal; Aebersold, Ruedi; Caron, Etienne.

Nucleic Acids Res ; 46(D1): D1237-D1247, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-28985418

RESUMO

Mass spectrometry (MS)-based immunopeptidomics investigates the repertoire of peptides presented at the cell surface by major histocompatibility complex (MHC) molecules. The broad clinical relevance of MHC-associated peptides, e.g. in precision medicine, provides a strong rationale for the large-scale generation of immunopeptidomic datasets and recent developments in MS-based peptide analysis technologies now support the generation of the required data. Importantly, the availability of diverse immunopeptidomic datasets has resulted in an increasing need to standardize, store and exchange this type of data to enable better collaborations among researchers, to advance the field more efficiently and to establish quality measures required for the meaningful comparison of datasets. Here we present the SysteMHC Atlas (https://systemhcatlas.org), a public database that aims at collecting, organizing, sharing, visualizing and exploring immunopeptidomic data generated by MS. The Atlas includes raw mass spectrometer output files collected from several laboratories around the globe, a catalog of context-specific datasets of MHC class I and class II peptides, standardized MHC allele-specific peptide spectral libraries consisting of consensus spectra calculated from repeat measurements of the same peptide sequence, and links to other proteomics and immunology databases. The SysteMHC Atlas project was created and will be further expanded using a uniform and open computational pipeline that controls the quality of peptide identifications and peptide annotations. Thus, the SysteMHC Atlas disseminates quality controlled immunopeptidomic information to the public domain and serves as a community resource toward the generation of a high-quality comprehensive map of the human immunopeptidome and the support of consistent measurement of immunopeptidomic sample cohorts.

Assuntos

Bases de Dados Factuais , Antígenos HLA , Antígenos de Histocompatibilidade , Espectrometria de Massas , Alelos , Antígenos HLA/química , Antígenos HLA/imunologia , Antígenos de Histocompatibilidade/química , Antígenos de Histocompatibilidade/imunologia , Humanos , Internet , Espectrometria de Massas em Tandem , Interface Usuário-Computador

12.

Systematic proteome and proteostasis profiling in human Trisomy 21 fibroblast cells.

Liu, Yansheng; Borel, Christelle; Li, Li; Müller, Torsten; Williams, Evan G; Germain, Pierre-Luc; Buljan, Marija; Sajic, Tatjana; Boersema, Paul J; Shao, Wenguang; Faini, Marco; Testa, Giuseppe; Beyer, Andreas; Antonarakis, Stylianos E; Aebersold, Ruedi.

Nat Commun ; 8(1): 1212, 2017 10 31.

Artigo em Inglês | MEDLINE | ID: mdl-29089484

RESUMO

Down syndrome (DS) is mostly caused by a trisomy of the entire Chromosome 21 (Trisomy 21, T21). Here, we use SWATH mass spectrometry to quantify protein abundance and protein turnover in fibroblasts from a monozygotic twin pair discordant for T21, and to profile protein expression in 11 unrelated DS individuals and matched controls. The integration of the steady-state and turnover proteomic data indicates that protein-specific degradation of members of stoichiometric complexes is a major determinant of T21 gene dosage outcome, both within and between individuals. This effect is not apparent from genomic and transcriptomic data. The data also reveal that T21 results in extensive proteome remodeling, affecting proteins encoded by all chromosomes. Finally, we find broad, organelle-specific post-transcriptional effects such as significant downregulation of the mitochondrial proteome contributing to T21 hallmarks. Overall, we provide a valuable proteomic resource to understand the origin of DS phenotypic manifestations.

Assuntos

Fibroblastos/metabolismo , Fibroblastos/patologia , Proteoma/metabolismo , Proteostase , Trissomia/patologia , Bases de Dados de Proteínas , Mecanismo Genético de Compensação de Dose , Regulação da Expressão Gênica , Humanos , Proteínas Mitocondriais/genética , Proteínas Mitocondriais/metabolismo , Organelas/metabolismo , Proteólise , Proteostase/genética , Transdução de Sinais , Trissomia/genética

13.

Tandem mass spectral libraries of peptides and their roles in proteomics research.

Shao, Wenguang; Lam, Henry.

Mass Spectrom Rev ; 36(5): 634-648, 2017 09.

Artigo em Inglês | MEDLINE | ID: mdl-27403644

RESUMO

Proteomics is a rapidly maturing field aimed at the high-throughput identification and quantification of all proteins in a biological system. The cornerstone of proteomic technology is tandem mass spectrometry of peptides resulting from the digestion of protein mixtures. The fragmentation pattern of each peptide ion is captured in its tandem mass spectrum, which enables its identification and acts as a fingerprint for the peptide. Spectral libraries are simply searchable collections of these fingerprints, which have taken on an increasingly prominent role in proteomic data analysis. This review describes the historical development of spectral libraries in proteomics, details the computational procedures behind library building and searching, surveys the current applications of spectral libraries, and discusses the outstanding challenges. © 2016 Wiley Periodicals, Inc. Mass Spec Rev 36:634-648, 2017.

Assuntos

Bases de Dados de Proteínas , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Algoritmos , Animais , Mineração de Dados , História do Século XX , História do Século XXI , Peptídeos/análise , Peptídeos/química , Peptídeos/metabolismo , Filogenia , Processamento de Proteína Pós-Traducional , Software , Espectrometria de Massas em Tandem/história , Espectrometria de Massas em Tandem/estatística & dados numéricos , Interface Usuário-Computador

14.

A peptide identification-free, genome sequence-independent shotgun proteomics workflow for strain-level bacterial differentiation.

Shao, Wenguang; Zhang, Min; Lam, Henry; Lau, Stanley C K.

Sci Rep ; 5: 14337, 2015 Sep 23.

Artigo em Inglês | MEDLINE | ID: mdl-26395646

RESUMO

Shotgun proteomics is an emerging tool for bacterial identification and differentiation. However, the identification of the mass spectra of peptides to genome-derived peptide sequences remains a key issue that limits the use of shotgun proteomics to bacteria with genome sequences available. In this proof-of-concept study, we report a novel bacterial fingerprinting method that enjoys the resolving power and accuracy of mass spectrometry without the burden of peptide identification (i.e. genome sequence-independent). This method uses a similarity-clustering algorithm to search for mass spectra that are derived from the same peptide and merge them into a unique consensus spectrum as the basis to generate proteomic fingerprints of bacterial isolates. In comparison to a traditional peptide identification-based shotgun proteomics workflow and a PCR-based DNA fingerprinting method targeting the repetitive extragenic palindromes elements in bacterial genomes, the novel method generated fingerprints that were richer in information and more discriminative in differentiating E. coli isolates by their animal sources. The novel method is readily deployable to any cultivable bacteria, and may be used for several fields of study such as environmental microbiology, applied microbiology, and clinical microbiology.

Assuntos

Impressões Digitais de DNA/métodos , DNA Bacteriano/genética , Escherichia coli/classificação , Escherichia coli/genética , Tipagem Molecular/métodos , Algoritmos , Animais , Sequência de Bases , Cromatografia Líquida/métodos , Escherichia coli/isolamento & purificação , Fezes/microbiologia , Reação em Cadeia da Polimerase/métodos , Proteômica/métodos , Análise de Sequência de DNA , Espectrometria de Massas em Tandem/métodos

15.

Tracking the sources of blood meals of parasitic arthropods using shotgun proteomics and unidentified tandem mass spectral libraries.

Önder, Özlem; Shao, Wenguang; Lam, Henry; Brisson, Dustin.

Nat Protoc ; 9(4): 842-50, 2014 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-24625782

RESUMO

Identifying the species on which hematophagous arthropods feed is crucial for studying the factors that affect pathogen distributions and that can aid public health. Here we describe a protocol to identify the species a parasitic arthropod has previously fed upon by identifying the source of the remnants of a previous blood meal via shotgun proteomics and spectral matching. The protocol is a nontargeted approach that uses the entire detected blood proteome for source identification; it does not require a priori knowledge of genome or protein sequences. Instead, reference spectral libraries are compiled from the blood of multiple host species by using SpectraST, which takes â¼4 d; the identification of the species from which a previous blood meal of a hematophagous arthropod was taken is achieved with spectral matching against the reference spectral libraries, which takes approximately another 4 d. This method is robust against random degradation of the blood meal and can identify unknown blood remnants months after the feeding event.

Assuntos

Artrópodes , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Vertebrados/parasitologia , Animais , Sangue , Ixodes , Vertebrados/sangue

16.

Refining similarity scoring to enable decoy-free validation in spectral library searching.

Shao, Wenguang; Zhu, Kan; Lam, Henry.

Proteomics ; 13(22): 3273-83, 2013 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-24115759

RESUMO

Spectral library searching is a maturing approach for peptide identification from MS/MS, offering an alternative to traditional sequence database searching. Spectral library searching relies on direct spectrum-to-spectrum matching between the query data and the spectral library, which affords better discrimination of true and false matches, leading to improved sensitivity. However, due to the inherent diversity of the peak location and intensity profiles of real spectra, the resulting similarity score distributions often take on unpredictable shapes. This makes it difficult to model the scores of the false matches accurately, necessitating the use of decoy searching to sample the score distribution of the false matches. Here, we refined the similarity scoring in spectral library searching to enable the validation of spectral search results without the use of decoys. We rank-transformed the peak intensities to standardize all spectra, making it possible to fit a parametric distribution to the scores of the nontop-scoring spectral matches. The statistical significance of the top-scoring match can then be estimated in a rigorous manner according to Extreme Value Theory. The overall result is a more robust and interpretable measure of the quality of the spectral match, which can be obtained without decoys. We tested this refined similarity scoring function on real datasets and demonstrated its effectiveness. This approach reduces search time, increases sensitivity, and extends spectral library searching to situations where decoy spectra cannot be readily generated, such as in searching unidentified and nonpeptide spectral libraries.

Assuntos

Biologia Computacional/métodos , Bases de Dados de Proteínas , Modelos Estatísticos , Peptídeos , Espectrometria de Massas em Tandem/métodos , Linhagem Celular Tumoral , Humanos , Fragmentos de Peptídeos , Peptídeos/análise , Peptídeos/química , Peptídeos/classificação , Reprodutibilidade dos Testes , Proteínas de Saccharomyces cerevisiae , Sensibilidade e Especificidade

17.

Denoising peptide tandem mass spectra for spectral libraries: a Bayesian approach.

Shao, Wenguang; Lam, Henry.

J Proteome Res ; 12(7): 3223-32, 2013 Jul 05.

Artigo em Inglês | MEDLINE | ID: mdl-23675732

RESUMO

With the rapid accumulation of data from shotgun proteomics experiments, it has become feasible to build comprehensive and high-quality spectral libraries of tandem mass spectra of peptides. A spectral library condenses experimental data into a retrievable format and can be used to aid peptide identification by spectral library searching. A key step in spectral library building is spectrum denoising, which is best accomplished by merging multiple replicates of the same peptide ion into a consensus spectrum. However, this approach cannot be applied to "singleton spectra," for which only one observed spectrum is available for the peptide ion. We developed a method, based on a Bayesian classifier, for denoising peptide tandem mass spectra. The classifier accounts for relationships between peaks, and can be trained on the fly from consensus spectra and immediately applied to denoise singleton spectra, without hard-coded knowledge about peptide fragmentation. A linear regression model was also trained to predict the number of useful "signal" peaks in a spectrum, thereby obviating the need for arbitrary thresholds for peak filtering. This Bayesian approach accumulates weak evidence systematically to boost the discrimination power between signal and noise peaks, and produces readily interpretable conditional probabilities that offer valuable insights into peptide fragmentation behaviors. By cross validation, spectra denoised by this method were shown to retain more signal peaks, and have higher spectral similarities to replicates, than those filtered by intensity only.

Assuntos

Teorema de Bayes , Peptídeos/isolamento & purificação , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Algoritmos , Bases de Dados de Proteínas , Fragmentos de Peptídeos/química , Fragmentos de Peptídeos/isolamento & purificação , Peptídeos/química , Razão Sinal-Ruído , Software

18.

Identifying sources of tick blood meals using unidentified tandem mass spectral libraries.

Önder, Özlem; Shao, Wenguang; Kemps, Brian D; Lam, Henry; Brisson, Dustin.

Nat Commun ; 4: 1746, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23612287

RESUMO

Rapid and reliable identification of the vertebrate species on which a disease vector previously parasitized is imperative to study ecological factors that affect pathogen distribution and can aid the development of public health programs. Here we describe a proteome profiling technique designed to identify the source of blood meals of haematophagous arthropods. This method employs direct spectral matching and thus does not require a priori knowledge of any genetic or protein sequence information. Using this technology, we detect remnants of blood in blacklegged ticks (Ixodes scapularis) and correctly determine the vertebrate species from which the blood was derived, even 6 months after the tick had fed. This biological fingerprinting methodology is sensitive, fast, cost-effective and can potentially be adapted for other biological and medical applications when existing genome-based methods are impractical or ineffective.

Assuntos

Comportamento Alimentar/fisiologia , Biblioteca de Peptídeos , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Carrapatos/fisiologia , Algoritmos , Animais , Análise por Conglomerados , Sequência Conservada , Evolução Molecular , Larva/metabolismo , Camundongos , Muda , Proteoma/metabolismo , Especificidade da Espécie , Vertebrados/parasitologia

19.

Understanding the improved sensitivity of spectral library searching over sequence database searching in proteomics data analysis.

Zhang, Xin; Li, Yunzi; Shao, Wenguang; Lam, Henry.

Proteomics ; 11(6): 1075-85, 2011 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-21298786

RESUMO

Spectral library searching has been recently proposed as an alternative to sequence database searching for peptide identification from MS/MS. We performed a systematic comparison between spectral library searching and sequence database searching using a wide variety of data to better demonstrate, and understand, the superior sensitivity of the former observed in preliminary studies. By decoupling the effect of search space, we demonstrated that the success of spectral library searching is primarily attributable to the use of real library spectra for matching, without which the sensitivity advantage largely disappears. We further determined the extent to which the use of real peak intensities and non-canonical fragments, both under-utilized information in sequence database searching, contributes to the sensitivity advantage. Lastly, we showed that spectral library searching is disproportionately more successful in identifying low-quality spectra, and complex spectra of higher- charged precursors, both important frontiers in peptide sequencing. Our results answered important outstanding questions about this promising yet unproven method using well-controlled computational experiments and sound statistical approaches.

Assuntos

Bases de Dados de Proteínas/estatística & dados numéricos , Biblioteca de Peptídeos , Proteômica/estatística & dados numéricos , Biologia Computacional , Interpretação Estatística de Dados , Humanos , Ferramenta de Busca , Espectrometria de Massas em Tandem/estatística & dados numéricos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA