Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 82
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Proteomics ; 24(1-2): e2200332, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37876146

RESUMO

This article summarizes the PROTREC method and investigates the impact that the different hyper-parameters have on the task of missing protein prediction using PROTREC. We evaluate missing protein recovery rates using different PROTREC score selection approaches (MAX, MIN, MEDIAN, and MEAN), different PROTREC score thresholds, as well as different complex size thresholds. In addition, we included two additional cancer datasets in our analysis and introduced a new validation method to check both the robustness of the PROTREC method as well as the correctness of our analysis. Our analysis showed that the missing protein recovery rate can be improved by adopting PROTREC score selection operations of MIN, MEDIAN, and MEAN instead of the default MAX. However, this may come at a cost of reduced numbers of proteins predicted and validated. The users should therefore choose their hyper-parameters carefully to find a balance in the accuracy-quantity trade-off. We also explored the possibility of combining PROTREC with a p-value-based method (FCS) and demonstrated that PROTREC is able to perform well independently without any help from a p-value-based method. Furthermore, we conducted a downstream enrichment analysis to understand the biological pathways and protein networks within the cancerous tissues using the recovered proteins. Missing protein recovery rate using PROTREC can be improved by selecting a different PROTREC score selection method. Different PROTREC score selection methods and other hyper-parameters such as PROTREC score threshold and complex size threshold introduce accuracy-quantity trade-off. PROTREC is able to perform well independently of any filtering using a p-value-based method. Verification of the PROTREC method on additional cancer datasets. Downstream Enrichment Analysis to understand the biological pathways and protein networks in cancerous tissues.


Assuntos
Algoritmos , Neoplasias , Humanos
2.
J Proteome Res ; 23(7): 2323-2331, 2024 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-38865581

RESUMO

The Chromosome-Centric Human Proteome Project (C-HPP) aims to identify all proteins encoded by the human genome. Currently, the human proteome still contains approximately 2000 PE2-PE5 proteins, referring to annotated coding genes that lack sufficient protein-level evidence. During the past 10 years, it has been increasingly difficult to identify PE2-PE5 proteins in C-HPP approaches due to the limited occurrence. Therefore, we proposed that reanalyzing massive MS data sets in repository with newly developed algorithms may increase the occurrence of the peptides of these proteins. In this study, we downloaded 1000 MS data sets via the ProteomeXchange database. Using pFind software, we identified peptides referring to 1788 PE2-PE5 proteins. Among them, 11 PE2 and 16 PE5 proteins were identified with at least 2 peptides, and 12 of them were identified using 2 peptides in a single data set, following the criteria of the HPP guidelines. We found translation evidence for 16 of the 11 PE2 and 16 PE5 proteins in our RNC-seq data, supporting their existence. The properties of the PE2 and PE5 proteins were similar to those of the PE1 proteins. Our approach demonstrated that mining PE2 and PE5 proteins in massive data repository is still worthy, and multidata set peptide identifications may support the presence of PE2 and PE5 proteins or at least prompt additional studies for validation. Extremely high throughput could be a solution to finding more PE2 and PE5 proteins.


Assuntos
Bases de Dados de Proteínas , Proteoma , Software , Humanos , Proteoma/análise , Proteoma/genética , Algoritmos , Espectrometria de Massas/métodos , Proteômica/métodos , Peptídeos/genética , Peptídeos/análise , Peptídeos/química , Genoma Humano
3.
J Proteome Res ; 23(1): 238-248, 2024 01 05.
Artigo em Inglês | MEDLINE | ID: mdl-38085962

RESUMO

Efforts to understand the complexities of human biology encompass multidimensional aspects, with proteins emerging as crucial components. However, studying the human ovary introduces unique challenges due to its complex dynamics and changes over a lifetime, varied cellular composition, and limited sample access. Here, four new RNA-seq samples of ovarian cortex spanning ages of 7 to 32 were sequenced and added to the existing data in the Human Protein Atlas (HPA) database www.proteinatlas.org, opening the doors to unique possibilities for exploration of oocyte-specific proteins. Based on transcriptomics analysis of the four new tissue samples representing both prepubertal girls and women of fertile age, we selected 20 protein candidates that lacked previous evidence at the protein level, so-called "missing proteins" (MPs). The proteins were validated using high-resolution antibody-based profiling and single-cell transcriptomics. Fourteen proteins exhibited consistent single-cell expression patterns in oocytes and granulosa cells, confirming their presence in the ovary and suggesting that these proteins play important roles in ovarian function, thus proposing that these 14 proteins should no longer be classified as MPs. This research significantly advances the understanding of MPs, unearthing fresh avenues for prospective exploration. By integrating innovative methodologies and leveraging the wealth of data in the HPA database, these insights contribute to refining our understanding of protein roles within the human ovary and opening the doors for further investigations into missing proteins and human reproduction.


Assuntos
Ovário , Proteômica , Humanos , Feminino , Estudos Prospectivos , Oócitos , Proteínas/metabolismo , Perfilação da Expressão Gênica
4.
J Proteome Res ; 23(2): 532-549, 2024 02 02.
Artigo em Inglês | MEDLINE | ID: mdl-38232391

RESUMO

Since 2010, the Human Proteome Project (HPP), the flagship initiative of the Human Proteome Organization (HUPO), has pursued two goals: (1) to credibly identify the protein parts list and (2) to make proteomics an integral part of multiomics studies of human health and disease. The HPP relies on international collaboration, data sharing, standardized reanalysis of MS data sets by PeptideAtlas and MassIVE-KB using HPP Guidelines for quality assurance, integration and curation of MS and non-MS protein data by neXtProt, plus extensive use of antibody profiling carried out by the Human Protein Atlas. According to the neXtProt release 2023-04-18, protein expression has now been credibly detected (PE1) for 18,397 of the 19,778 neXtProt predicted proteins coded in the human genome (93%). Of these PE1 proteins, 17,453 were detected with mass spectrometry (MS) in accordance with HPP Guidelines and 944 by a variety of non-MS methods. The number of neXtProt PE2, PE3, and PE4 missing proteins now stands at 1381. Achieving the unambiguous identification of 93% of predicted proteins encoded from across all chromosomes represents remarkable experimental progress on the Human Proteome parts list. Meanwhile, there are several categories of predicted proteins that have proved resistant to detection regardless of protein-based methods used. Additionally there are some PE1-4 proteins that probably should be reclassified to PE5, specifically 21 LINC entries and ∼30 HERV entries; these are being addressed in the present year. Applying proteomics in a wide array of biological and clinical studies ensures integration with other omics platforms as reported by the Biology and Disease-driven HPP teams and the antibody and pathology resource pillars. Current progress has positioned the HPP to transition to its Grand Challenge Project focused on determining the primary function(s) of every protein itself and in networks and pathways within the context of human health and disease.


Assuntos
Anticorpos , Proteoma , Humanos , Proteoma/genética , Proteoma/análise , Bases de Dados de Proteínas , Espectrometria de Massas/métodos , Proteômica/métodos
5.
J Proteome Res ; 22(4): 1148-1158, 2023 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-36445260

RESUMO

The Chromosome-centric Human Proteome Project (C-HPP) aims at identifying the proteins as gene products encoded by the human genome, characterizing their isoforms and functions. The existence of products has now been confirmed for 93.2% of the genes at the protein level. The remaining mostly correspond to proteins of low abundance or difficult to access. Over the past years, we have significantly contributed to the identification of missing proteins in the human spermatozoa. We pursue our search in the reproductive sphere with a focus on early human embryonic development. Pluripotent cells, developing into the fetus, and trophoblast cells, giving rise to the placenta, emerge during the first weeks. This emergence is a focus of scientists working in the field of reproduction, placentation and regenerative medicine. Most knowledge has been harnessed by transcriptomic analysis. Interestingly, some genes are uniquely expressed in those cells, giving the opportunity to uncover new proteins that might play a crucial role in setting up the molecular events underlying early embryonic development. Here, we analyzed naive pluripotent and trophoblastic stem cells and discovered 4 new missing proteins, thus contributing to the C-HPP. The mass spectrometry proteomics data was deposited on ProteomeXchange under the data set identifier PXD035768.


Assuntos
Proteoma , Trofoblastos , Masculino , Humanos , Proteoma/genética , Proteoma/análise , Espectrometria de Massas , Cromossomos/química , Linhagem Celular
6.
J Proteome Res ; 22(4): 1071-1079, 2023 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-36108145

RESUMO

In the quest for "missing proteins" (MPs), the proteins encoded by the human genome still lacking evidence of existence at the protein level, novel approaches are needed to detect this challenging group of proteins. The current count stands at 1,343 MPs, and it is likely that many of these proteins are expressed at low levels, in rare cell or tissue types, or the cells in which they are expressed may only represent a small minority of the tissue. Here, we used an integrated omics approach to identify and explore MPs in human ovaries. By taking advantage of publicly available transcriptomics and antibody-based proteomics data in the Human Protein Atlas (HPA), we selected 18 candidates for further immunohistochemical analysis using an exclusive collection of ovarian tissues from women and patients of reproductive age. The results were compared with data from single-cell mRNA sequencing, and seven proteins (CTXN1, MRO, RERGL, TTLL3, TRIM61, TRIM73, and ZNF793) could be validated at the single-cell type level with both methods. We present for the first time the cell type-specific spatial localization of 18 MPs in human ovarian follicles, thereby showcasing the utility of the HPA database as an important resource for identification of MPs suitable for exploration in specialized tissue samples. The results constitute a starting point for further quantitative and qualitative analysis of the human ovaries, and the novel data for the seven proteins that were validated with both methods should be considered as evidence of existence of these proteins in human ovary.


Assuntos
Ovário , Proteômica , Humanos , Feminino , Ovário/química , Proteômica/métodos , Proteínas/metabolismo , Anticorpos/metabolismo , Perfilação da Expressão Gênica , Proteoma/genética , Proteoma/análise
7.
J Proteome Res ; 22(4): 1024-1042, 2023 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-36318223

RESUMO

The 2022 Metrics of the Human Proteome from the HUPO Human Proteome Project (HPP) show that protein expression has now been credibly detected (neXtProt PE1 level) for 18 407 (93.2%) of the 19 750 predicted proteins coded in the human genome, a net gain of 50 since 2021 from data sets generated around the world and reanalyzed by the HPP. Conversely, the number of neXtProt PE2, PE3, and PE4 missing proteins has been reduced by 78 from 1421 to 1343. This represents continuing experimental progress on the human proteome parts list across all the chromosomes, as well as significant reclassifications. Meanwhile, applying proteomics in a vast array of biological and clinical studies continues to yield significant findings and growing integration with other omics platforms. We present highlights from the Chromosome-Centric HPP, Biology and Disease-driven HPP, and HPP Resource Pillars, compare features of mass spectrometry and Olink and Somalogic platforms, note the emergence of translation products from ribosome profiling of small open reading frames, and discuss the launch of the initial HPP Grand Challenge Project, "A Function for Each Protein".


Assuntos
Proteoma , Proteômica , Humanos , Proteoma/genética , Proteoma/análise , Bases de Dados de Proteínas , Espectrometria de Massas/métodos , Fases de Leitura Aberta , Proteômica/métodos
8.
Mol Cell Proteomics ; 20: 100062, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33640492

RESUMO

We celebrate the 10th anniversary of the launch of the HUPO Human Proteome Project (HPP) and its major milestone of confident detection of at least one protein from each of 90% of the predicted protein-coding genes, based on the output of the entire proteomics community. The Human Genome Project reached a similar decadal milestone 20 years ago. The HPP has engaged proteomics teams around the world, strongly influenced data-sharing, enhanced quality assurance, and issued stringent guidelines for claims of detecting previously "missing proteins." This invited perspective complements papers on "A High-Stringency Blueprint of the Human Proteome" and "The Human Proteome Reaches a Major Milestone" in special issues of Nature Communications and Journal of Proteome Research, respectively, released in conjunction with the October 2020 virtual HUPO Congress and its celebration of the 10th anniversary of the HUPO HPP.


Assuntos
Proteoma , Sociedades Científicas/história , Confiabilidade dos Dados , História do Século XXI , Humanos , Disseminação de Informação
9.
J Proteome Res ; 20(12): 5329-5339, 2021 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-34748338

RESUMO

With the steadfast development of proteomic technology, the number of missing proteins (MPs) has been continuously shrinking, with approximately 1470 MPs that have not been explored yet. Due to this phenomenon, the discovery of MPs has been increasingly more difficult and elusive. In order to face this challenge, we have hypothesized that a stable aneuploid cell line with increased chromosomes serves as a useful material for assisting MP exploration. Ker-CT cell line with trisomy at chromosome 5 and 20 was selected for this purpose. With a combination strategy of RNA-Seq and LC-MS/MS, a total of 22 178 transcripts and 8846 proteins were identified in Ker-CT. Although the transcripts corresponding to 15 and 15 MP genes located at chromosome 5 and 20 were detected, none of the MPs were found in Ker-CT. Surprisingly, 3 MPs containing at least two unique non-nest peptides of length ≥9 amino acids were identified in Ker-CT, whose genes are located on chromosome 3 and 10, respectively. Furthermore, the 3 MPs were verified using the method of parallel reaction monitoring (PRM). These results suggest that the abnormal status of chromosomes may not only impact the expression of the corresponding genes in trisomy chromosomes, but also influence that of other chromosomes, which benefits MP discovery. The data obtained in this study are available via ProteomeXchange (PXD028647) and PeptideAtlas (PASS01700), respectively.


Assuntos
Proteogenômica , Proteômica , Aneuploidia , Linhagem Celular , Cromatografia Líquida , Humanos , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos
10.
J Proteome Res ; 20(12): 5227-5240, 2021 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-34670092

RESUMO

The 2021 Metrics of the HUPO Human Proteome Project (HPP) show that protein expression has now been credibly detected (neXtProt PE1 level) for 18 357 (92.8%) of the 19 778 predicted proteins coded in the human genome, a gain of 483 since 2020 from reports throughout the world reanalyzed by the HPP. Conversely, the number of neXtProt PE2, PE3, and PE4 missing proteins has been reduced by 478 to 1421. This represents remarkable progress on the proteome parts list. The utilization of proteomics in a broad array of biological and clinical studies likewise continues to expand with many important findings and effective integration with other omics platforms. We present highlights from the Immunopeptidomics, Glycoproteomics, Infectious Disease, Cardiovascular, Musculo-Skeletal, Liver, and Cancers B/D-HPP teams and from the Knowledgebase, Mass Spectrometry, Antibody Profiling, and Pathology resource pillars, as well as ethical considerations important to the clinical utilization of proteomics and protein biomarkers.


Assuntos
Benchmarking , Proteoma , Bases de Dados de Proteínas , Humanos , Espectrometria de Massas/métodos , Proteoma/análise , Proteoma/genética , Proteômica/métodos
11.
J Proteome Res ; 19(12): 4808-4814, 2020 12 04.
Artigo em Inglês | MEDLINE | ID: mdl-33172275

RESUMO

The Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2012 to perfect the annotation of human protein existence by identifying stronger evidence of the expression of missing proteins (MPs) at the protein level. After an 8 year effort all over the world, the number of MPs in the neXtProt database significantly decreased from 5511 (2012-02-24) to 1899 (2020-01-17). It is now more difficult to provide confident evidence of the remaining MPs because of their specific characteristics, including low abundance, low molecular weight, unexpected modifications, transmembrane structure, tissue-expression specificity, and so on. A higher resolution mass spectrometry (MS) interpretation engine might provide an opportunity to identify these buried MPs in complex samples by the combination with multi-tissue large-scale proteomics. In this study, open-pFind was used to dig MPs from 20 pairs of healthy human tissues by Wang et al. ( Mol. Syst. Biol. 2019, 15 (2), e8503) combined with our large-scale testis data set digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity for different amino acid residues ( J. Proteme Res. 2019, 18 (12), 4189-4196). A total of 1 535 536 peptides with 17 283 477 peptide-spectrum matches (PSMs) were mapped to 14 279 protein entries at a false discovery rate of <1% at the PSM, peptide, and protein levels. A total of 103 MP candidates were identified, among which 86 candidates had more unique peptide numbers compared with our single testis tissue. After rigorous screening, manual checks, peptide synthesis, and matching with documented peptides from PeptideAtlas, we validated four MPs, P0C7T8 (duodenum and small intestine), Q8WWZ4 (stomach and rectum), Q8IV35 (fallopian tube), and O14921 (tonsil), at the protein level. All MS raw files have been deposited to the ProteomeXchange with identifier PXD021391.


Assuntos
Proteoma , Proteômica , Feminino , Humanos , Masculino , Espectrometria de Massas , Peso Molecular , Peptídeos
12.
J Proteome Res ; 19(12): 4857-4866, 2020 12 04.
Artigo em Inglês | MEDLINE | ID: mdl-33210925

RESUMO

Since the Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2010, many techniques have been adopted for the discovery of missing proteins (MPs). Because of these efforts, only 1481 MPs remained as of July 2020; however, by relying only on technique optimization, researchers have reached a bottleneck in MP discovery. Protein expression is tissue- or cell-type-dependent. The tissues of the human testis and brain have been reported to harbor a large number of tissue-specific genes and proteins; however, few studies have been performed on human brain tissue or cells to identify MPs. Herein a metastatic cell line derived from brain cancer, D283 Med, was used to search for MPs. With a traditional and simple shotgun workflow to separate the peptides into 20 fractions, 12 MPs containing at least two unique non-nested peptides (amino acid length ≥9) were identified in this cell line with a protein false discovery rate of <1%. Following the same experimental protocol, only one MP was found in a nonmetastatic brain cancer cell line, U-118 MG. Furthermore, 12 MPs were verified as having two non-nested unique peptides by matching them with corresponding chemically synthesized peptides through parallel reaction monitoring. These results clearly demonstrate that the appropriate selection of experimental materials, either tissues or cell lines, is imperative for MP discovery. The data obtained in this study are available via ProteomeXchange (PXD021482) and PeptideAtlas (PASS01627).


Assuntos
Neoplasias Cerebelares , Meduloblastoma , Linhagem Celular , Humanos , Masculino , Meduloblastoma/genética , Peptídeos , Proteômica
13.
J Proteome Res ; 19(12): 4766-4781, 2020 12 04.
Artigo em Inglês | MEDLINE | ID: mdl-33170010

RESUMO

The localization of proteins at a tissue- or cell-type-specific level is tightly linked to the protein function. To better understand each protein's role in cellular systems, spatial information constitutes an important complement to quantitative data. The standard methods for determining the spatial distribution of proteins in single cells of complex tissue samples make use of antibodies. For a stringent analysis of the human proteome, we used orthogonal methods and independent antibodies to validate 5981 antibodies that show the expression of 3775 human proteins across all major human tissues. This enhanced validation uncovered 56 proteins corresponding to the group of "missing proteins" and 171 proteins of unknown function. The presented strategy will facilitate further discussions around criteria for evidence of protein existence based on immunohistochemistry and serves as a useful guide to identify candidate proteins for integrative studies with quantitative proteomics methods.


Assuntos
Proteoma , Proteômica , Anticorpos , Humanos , Imuno-Histoquímica
14.
J Proteome Res ; 19(1): 401-408, 2020 01 03.
Artigo em Inglês | MEDLINE | ID: mdl-31773964

RESUMO

The mission of the Chromosome-Centric Human Proteome Project (C-HPP) to discover missing proteins (MPs) has become increasingly difficult due to the remaining low-abundance, high-hydrophobicity, or low-molecular-weight MPs. We have reported two approaches to resolve these identification problems for the low-abundance and high-hydrophobicity MPs, respectively. In this study, to improve the identification of low-abundance MPs with high hydrophobicity, we combined two approaches and obtained MPs from several different cancer cell lines. Their membrane fractions were isolated by ultracentrifugation, and the low-abundance proteins were enriched at the protein level with the ProteoMiner kit. After that, the peptides from the enriched proteins were separated by high concentrations of organic solvents according to their hydrophobicity as the first dimension of separation at the peptide level, and the second and third dimensions of separation involved a high pH reversed-phase and an acid reversed-phase column, respectively. In total, 16 MPs (at least two non-nested unique peptides with ≥9 amino acids) with 61 unique peptides were identified from four human cancer cell lines, including 2, 8, 2, and 7 MPs from HeLa, HCT116, SNU-1, and HepG2 cells, respectively. Furthermore, all MPs were verified with two non-nested unique peptides through parallel reaction monitoring (PRM) by matching the peptides with their chemically synthesized peptides. Interestingly, two additional MPs were verified from the same cell line by PRM assay, although the two non-nested unique peptides with ≥9 amino acids for each MP were identified from different MS injections or cell lines by data-dependent acquisition (DDA). Thus, a total of 18 MPs were dug out in this study. The data are available via ProteomeXchange (PXD014058) and PeptideAtlas (PASS01388).


Assuntos
Proteínas/análise , Proteínas/química , Proteômica/métodos , Linhagem Celular Tumoral , Eletroforese em Gel de Poliacrilamida , Humanos , Interações Hidrofóbicas e Hidrofílicas , Espectrometria de Massas/métodos
15.
J Proteome Res ; 19(12): 4747-4753, 2020 12 04.
Artigo em Inglês | MEDLINE | ID: mdl-33124832

RESUMO

The Chromosome-Centric Human Proteome Project (C-HPP) aims at the identification of missing proteins (MPs) and the functional characterization of functionally unannotated PE1 (uPE1) proteins. A major challenge in addressing this goal is that many human proteins and MPs are silent in adult cells. A promising approach to overcome such challenge is to exploit the advantage of novel tools such as pluripotent stem cells (PSCs), which are capable of differentiation into three embryonic germ layers, namely, the endoderm, mesoderm, and ectoderm. Here we present several examples of how the Human Y Chromosome Proteome Project (Y-HPP) benefited from this approach to meet C-HPP goals. Furthermore, we discuss how integrating CRISPR engineering, human-induced pluripotent stem cell (hiPSC)-derived disease modeling systems, and organoid technologies provides a unique platform for Y-HPP and C-HPP for MP identification and the functional characterization of human proteins, especially uPE1s.


Assuntos
Células-Tronco Pluripotentes , Proteoma , Diferenciação Celular , Cromossomos Humanos Y , Humanos , Proteoma/genética
16.
J Proteome Res ; 19(12): 4735-4746, 2020 12 04.
Artigo em Inglês | MEDLINE | ID: mdl-32931287

RESUMO

According to the 2020 Metrics of the HUPO Human Proteome Project (HPP), expression has now been detected at the protein level for >90% of the 19 773 predicted proteins coded in the human genome. The HPP annually reports on progress made throughout the world toward credibly identifying and characterizing the complete human protein parts list and promoting proteomics as an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2020-01 classified 17 874 proteins as PE1, having strong protein-level evidence, up 180 from 17 694 one year earlier. These represent 90.4% of the 19 773 predicted coding genes (all PE1,2,3,4 proteins in neXtProt). Conversely, the number of neXtProt PE2,3,4 proteins, termed the "missing proteins" (MPs), was reduced by 230 from 2129 to 1899 since the neXtProt 2019-01 release. PeptideAtlas is the primary source of uniform reanalysis of raw mass spectrometry data for neXtProt, supplemented this year with extensive data from MassIVE. PeptideAtlas 2020-01 added 362 canonical proteins between 2019 and 2020 and MassIVE contributed 84 more, many of which converted PE1 entries based on non-MS evidence to the MS-based subgroup. The 19 Biology and Disease-driven B/D-HPP teams continue to pursue the identification of driver proteins that underlie disease states, the characterization of regulatory mechanisms controlling the functions of these proteins, their proteoforms, and their interactions, and the progression of transitions from correlation to coexpression to causal networks after system perturbations. And the Human Protein Atlas published Blood, Brain, and Metabolic Atlases.


Assuntos
Proteoma , Proteômica , Bases de Dados de Proteínas , Genoma Humano , Humanos , Espectrometria de Massas , Proteoma/genética
17.
Cell Biol Toxicol ; 36(3): 261-272, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-31599373

RESUMO

In the advanced stages, malignant melanoma (MM) has a very poor prognosis. Due to tremendous efforts in cancer research over the last 10 years, and the introduction of novel therapies such as targeted therapies and immunomodulators, the rather dark horizon of the median survival has dramatically changed from under 1 year to several years. With the advent of proteomics, deep-mining studies can reach low-abundant expression levels. The complexity of the proteome, however, still surpasses the dynamic range capabilities of current analytical techniques. Consequently, many predicted protein products with potential biological functions have not yet been verified in experimental proteomic data. This category of 'missing proteins' (MP) is comprised of all proteins that have been predicted but are currently unverified. As part of the initiative launched in 2016 in the USA, the European Cancer Moonshot Center has performed numerous deep proteomics analyses on samples from MM patients. In this study, nine MPs were clearly identified by mass spectrometry in MM metastases. Some MPs significantly correlated with proteins that possess identical PFAM structural domains; and other MPs were significantly associated with cancer-related proteins. This is the first study to our knowledge, where unknown and novel proteins have been annotated in metastatic melanoma tumour tissue.


Assuntos
Melanoma/genética , Metástase Neoplásica/genética , Proteômica/métodos , Adulto , Biomarcadores Tumorais/genética , Feminino , Genoma Humano/genética , Humanos , Masculino , Pessoa de Meia-Idade , Anotação de Sequência Molecular/métodos , Anotação de Sequência Molecular/tendências , Prognóstico , Proteoma/genética , Proteoma/metabolismo , Neoplasias Cutâneas/genética , Melanoma Maligno Cutâneo
18.
J Proteome Res ; 18(12): 4197-4205, 2019 12 06.
Artigo em Inglês | MEDLINE | ID: mdl-31646870

RESUMO

The Human Proteome Project (HPP) has made great efforts to clarify the existing evidence of human proteins since 2012. However, according to the recent release of neXtProt (2019-1), approximately 10% of all human genes still have inadequate or no experimental evidence of their translation at the protein level. They were categorized as missing proteins (PE2-PE4). To further the goal of HPP, we developed a two-step bioinformatic strategy addressing the utilization of the SRMAtlas synthetic peptides corresponding to the missing proteins as an exclusive reference in order to explore their natural counterparts within GPM. In the first step, we searched the GPM for the non-nested SRMAtlas peptides corresponding to the missing proteins, taking under consideration only those detected via ≥2 non-nested unitypic/proteotypic peptides "Stranded peptides" with length ≥9 amino acids in the same proteomic study. As a result, 51 missing proteins were newly detected in 35 different proteomic studies. In the second step, we validated these newly detected missing proteins based on matching the spectra of their synthetic and natural peptides in SRMAtlas and GPM, respectively. The results showed that 23 of the missing proteins with ≥2 non-nested peptides were validated by careful spectral matching.


Assuntos
Bases de Dados de Proteínas , Proteoma , Proteômica/métodos , Humanos , Peptídeos/síntese química , Mapas de Interação de Proteínas , Reprodutibilidade dos Testes , Software
19.
J Proteome Res ; 18(12): 4189-4196, 2019 12 06.
Artigo em Inglês | MEDLINE | ID: mdl-31657219

RESUMO

In recent years, high-throughput technologies have contributed to the development of a more precise picture of the human proteome. However, 2129 proteins remain listed as missing proteins (MPs) in the newest neXtProt release (2019-02). The main reasons for MPs are a low abundance, a low molecular weight, unexpected modifications, membrane characteristics, and so on. Moreover, >50% of the MS/MS data have not been successfully identified in shotgun proteomics. Open-pFind, an efficient open search engine, recently released by the pFind group in China, might provide an opportunity to identify these buried MPs in complex samples. In this study, proteins and potential MPs were identified using Open-pFind and three other search engines to compare their performance and efficiency with three large-scale data sets digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity on different amino acid (AA) residues. Our results demonstrated that Open-pFind identified 44.7-93.1% more peptide-spectrum matches and 21.3-61.6% more peptide sequences than the second-best search engine. As a result, Open-pFind detected 53.1% more MP candidates than MaxQuant and 8.8% more candidate MPs than Proteome Discoverer. In total, 5 (PE2) of the 124 MP candidates identified by Open-pFind were verified with 2 or 3 unique peptides containing more than 9 AAs by using a spectrum theoretical prediction with pDeep and synthesized peptide matching with pBuild after spectrum quality analysis, isobaric post-translational modification, and single amino acid variant filtering. These five verified MPs can be saved as PE1 proteins. In addition, three other MP candidates were verified with two unique peptides (one peptide containing more than 9 AAs and the other containing only 8 AAs), which was slightly lower than the criteria listed by C-HPP and required additional verification information. More importantly, unexpected modifications were detected in these MPs. All MS data sets have been deposited into ProteomeXchange with the identifier PXD015759.


Assuntos
Bases de Dados de Proteínas , Software , Testículo/química , Humanos , Masculino , Espectrometria de Massas , Processamento de Proteína Pós-Traducional , Proteínas/análise , Proteínas/genética , Proteínas/metabolismo , Proteômica/métodos , Ferramenta de Busca
20.
J Proteome Res ; 18(12): 4215-4230, 2019 12 06.
Artigo em Inglês | MEDLINE | ID: mdl-31429579

RESUMO

One of the most complex organs in the human body is the testis, where spermatogenesis takes place. This physiological process involves thousands of genes and proteins that are activated and repressed, making testis the organ with the highest number of tissue-specific genes. However, the function of a large proportion of the corresponding proteins remains unknown and testis harbors many missing proteins (MPs), defined as products of protein-coding genes that lack experimental mass spectrometry evidence. Here, an integrated omics approach was used for exploring the cell type-specific protein expression of genes with an elevated expression in testis. By combining genome-wide transcriptomics analysis with immunohistochemistry, more than 500 proteins with distinct testicular protein expression patterns were identified, and these were selected for in-depth characterization of their in situ expression in eight different testicular cell types. The cell type-specific protein expression patterns allowed us to identify six distinct clusters of expression at different stages of spermatogenesis. The analysis highlighted numerous poorly characterized proteins in each of these clusters whose expression overlapped with that of known proteins involved in spermatogenesis, including 85 proteins with an unknown function and 60 proteins that previously have been classified as MPs. Furthermore, we were able to characterize the in situ distribution of several proteins that previously lacked spatial information and cell type-specific expression within the testis. The testis elevated expression levels both at the RNA and protein levels suggest that these proteins are related to testis-specific functions. In summary, the study demonstrates the power of combining genome-wide transcriptomics analysis with antibody-based protein profiling to explore the cell type-specific expression of both well-known proteins and MPs. The analyzed proteins constitute important targets for further testis-specific research in male reproductive disorders.


Assuntos
Proteínas/metabolismo , Testículo/citologia , Testículo/fisiologia , Anticorpos , Expressão Gênica , Humanos , Masculino , Proteínas/genética , Proteínas/imunologia , Espermatogênese , Transcriptoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA