Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 83
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Proteomics ; 24(1-2): e2200332, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-37876146

RESUMEN

This article summarizes the PROTREC method and investigates the impact that the different hyper-parameters have on the task of missing protein prediction using PROTREC. We evaluate missing protein recovery rates using different PROTREC score selection approaches (MAX, MIN, MEDIAN, and MEAN), different PROTREC score thresholds, as well as different complex size thresholds. In addition, we included two additional cancer datasets in our analysis and introduced a new validation method to check both the robustness of the PROTREC method as well as the correctness of our analysis. Our analysis showed that the missing protein recovery rate can be improved by adopting PROTREC score selection operations of MIN, MEDIAN, and MEAN instead of the default MAX. However, this may come at a cost of reduced numbers of proteins predicted and validated. The users should therefore choose their hyper-parameters carefully to find a balance in the accuracy-quantity trade-off. We also explored the possibility of combining PROTREC with a p-value-based method (FCS) and demonstrated that PROTREC is able to perform well independently without any help from a p-value-based method. Furthermore, we conducted a downstream enrichment analysis to understand the biological pathways and protein networks within the cancerous tissues using the recovered proteins. Missing protein recovery rate using PROTREC can be improved by selecting a different PROTREC score selection method. Different PROTREC score selection methods and other hyper-parameters such as PROTREC score threshold and complex size threshold introduce accuracy-quantity trade-off. PROTREC is able to perform well independently of any filtering using a p-value-based method. Verification of the PROTREC method on additional cancer datasets. Downstream Enrichment Analysis to understand the biological pathways and protein networks in cancerous tissues.


Asunto(s)
Algoritmos , Neoplasias , Humanos
2.
J Proteome Res ; 23(7): 2323-2331, 2024 Jul 05.
Artículo en Inglés | MEDLINE | ID: mdl-38865581

RESUMEN

The Chromosome-Centric Human Proteome Project (C-HPP) aims to identify all proteins encoded by the human genome. Currently, the human proteome still contains approximately 2000 PE2-PE5 proteins, referring to annotated coding genes that lack sufficient protein-level evidence. During the past 10 years, it has been increasingly difficult to identify PE2-PE5 proteins in C-HPP approaches due to the limited occurrence. Therefore, we proposed that reanalyzing massive MS data sets in repository with newly developed algorithms may increase the occurrence of the peptides of these proteins. In this study, we downloaded 1000 MS data sets via the ProteomeXchange database. Using pFind software, we identified peptides referring to 1788 PE2-PE5 proteins. Among them, 11 PE2 and 16 PE5 proteins were identified with at least 2 peptides, and 12 of them were identified using 2 peptides in a single data set, following the criteria of the HPP guidelines. We found translation evidence for 16 of the 11 PE2 and 16 PE5 proteins in our RNC-seq data, supporting their existence. The properties of the PE2 and PE5 proteins were similar to those of the PE1 proteins. Our approach demonstrated that mining PE2 and PE5 proteins in massive data repository is still worthy, and multidata set peptide identifications may support the presence of PE2 and PE5 proteins or at least prompt additional studies for validation. Extremely high throughput could be a solution to finding more PE2 and PE5 proteins.


Asunto(s)
Bases de Datos de Proteínas , Proteoma , Programas Informáticos , Humanos , Proteoma/análisis , Proteoma/genética , Algoritmos , Espectrometría de Masas/métodos , Proteómica/métodos , Péptidos/genética , Péptidos/análisis , Péptidos/química , Genoma Humano
3.
J Proteome Res ; 23(1): 238-248, 2024 01 05.
Artículo en Inglés | MEDLINE | ID: mdl-38085962

RESUMEN

Efforts to understand the complexities of human biology encompass multidimensional aspects, with proteins emerging as crucial components. However, studying the human ovary introduces unique challenges due to its complex dynamics and changes over a lifetime, varied cellular composition, and limited sample access. Here, four new RNA-seq samples of ovarian cortex spanning ages of 7 to 32 were sequenced and added to the existing data in the Human Protein Atlas (HPA) database www.proteinatlas.org, opening the doors to unique possibilities for exploration of oocyte-specific proteins. Based on transcriptomics analysis of the four new tissue samples representing both prepubertal girls and women of fertile age, we selected 20 protein candidates that lacked previous evidence at the protein level, so-called "missing proteins" (MPs). The proteins were validated using high-resolution antibody-based profiling and single-cell transcriptomics. Fourteen proteins exhibited consistent single-cell expression patterns in oocytes and granulosa cells, confirming their presence in the ovary and suggesting that these proteins play important roles in ovarian function, thus proposing that these 14 proteins should no longer be classified as MPs. This research significantly advances the understanding of MPs, unearthing fresh avenues for prospective exploration. By integrating innovative methodologies and leveraging the wealth of data in the HPA database, these insights contribute to refining our understanding of protein roles within the human ovary and opening the doors for further investigations into missing proteins and human reproduction.


Asunto(s)
Ovario , Proteómica , Humanos , Femenino , Estudios Prospectivos , Oocitos , Proteínas/metabolismo , Perfilación de la Expresión Génica
4.
J Proteome Res ; 2024 Aug 20.
Artículo en Inglés | MEDLINE | ID: mdl-39163279

RESUMEN

This Technical Note presents a comprehensive proteomics workflow for the new combination of Orbitrap and Astral mass analyzers across biofluids, cells, and tissues. Central to our workflow is the integration of Adaptive Focused Acoustics (AFA) technology for cells and tissue lysis to ensure robust and reproducible sample preparation in a high-throughput manner. Furthermore, we automated the detergent-compatible single-pot, solid-phase-enhanced sample Preparation (SP3) method for protein digestion. The synergy of these advanced methodologies facilitates a robust and high-throughput approach for cell and tissue analysis, an important consideration in translational research. This work disseminates our platform workflow, analyzes the effectiveness, demonstrates the reproducibility of the results, and highlights the potential of these technologies in biomarker discovery and disease pathology. For cells and tissues (heart, liver, lung, and intestine) proteomics analysis by data-independent acquisition mode, identifications exceeding 10,000 proteins can be achieved with a 24 min active gradient. In 200 ng injections of HeLa digest across multiple gradients, an average of more than 80% of proteins have a CV less than 20%, and a 45 min run covers ∼90% of the expressed proteome. This complete workflow allows for large swaths of the proteome to be identified and is compatible with diverse sample types.

5.
J Proteome Res ; 23(2): 532-549, 2024 02 02.
Artículo en Inglés | MEDLINE | ID: mdl-38232391

RESUMEN

Since 2010, the Human Proteome Project (HPP), the flagship initiative of the Human Proteome Organization (HUPO), has pursued two goals: (1) to credibly identify the protein parts list and (2) to make proteomics an integral part of multiomics studies of human health and disease. The HPP relies on international collaboration, data sharing, standardized reanalysis of MS data sets by PeptideAtlas and MassIVE-KB using HPP Guidelines for quality assurance, integration and curation of MS and non-MS protein data by neXtProt, plus extensive use of antibody profiling carried out by the Human Protein Atlas. According to the neXtProt release 2023-04-18, protein expression has now been credibly detected (PE1) for 18,397 of the 19,778 neXtProt predicted proteins coded in the human genome (93%). Of these PE1 proteins, 17,453 were detected with mass spectrometry (MS) in accordance with HPP Guidelines and 944 by a variety of non-MS methods. The number of neXtProt PE2, PE3, and PE4 missing proteins now stands at 1381. Achieving the unambiguous identification of 93% of predicted proteins encoded from across all chromosomes represents remarkable experimental progress on the Human Proteome parts list. Meanwhile, there are several categories of predicted proteins that have proved resistant to detection regardless of protein-based methods used. Additionally there are some PE1-4 proteins that probably should be reclassified to PE5, specifically 21 LINC entries and ∼30 HERV entries; these are being addressed in the present year. Applying proteomics in a wide array of biological and clinical studies ensures integration with other omics platforms as reported by the Biology and Disease-driven HPP teams and the antibody and pathology resource pillars. Current progress has positioned the HPP to transition to its Grand Challenge Project focused on determining the primary function(s) of every protein itself and in networks and pathways within the context of human health and disease.


Asunto(s)
Anticuerpos , Proteoma , Humanos , Proteoma/genética , Proteoma/análisis , Bases de Datos de Proteínas , Espectrometría de Masas/métodos , Proteómica/métodos
6.
J Proteome Res ; 22(4): 1148-1158, 2023 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-36445260

RESUMEN

The Chromosome-centric Human Proteome Project (C-HPP) aims at identifying the proteins as gene products encoded by the human genome, characterizing their isoforms and functions. The existence of products has now been confirmed for 93.2% of the genes at the protein level. The remaining mostly correspond to proteins of low abundance or difficult to access. Over the past years, we have significantly contributed to the identification of missing proteins in the human spermatozoa. We pursue our search in the reproductive sphere with a focus on early human embryonic development. Pluripotent cells, developing into the fetus, and trophoblast cells, giving rise to the placenta, emerge during the first weeks. This emergence is a focus of scientists working in the field of reproduction, placentation and regenerative medicine. Most knowledge has been harnessed by transcriptomic analysis. Interestingly, some genes are uniquely expressed in those cells, giving the opportunity to uncover new proteins that might play a crucial role in setting up the molecular events underlying early embryonic development. Here, we analyzed naive pluripotent and trophoblastic stem cells and discovered 4 new missing proteins, thus contributing to the C-HPP. The mass spectrometry proteomics data was deposited on ProteomeXchange under the data set identifier PXD035768.


Asunto(s)
Proteoma , Trofoblastos , Masculino , Humanos , Proteoma/genética , Proteoma/análisis , Espectrometría de Masas , Cromosomas/química , Línea Celular
7.
J Proteome Res ; 22(4): 1071-1079, 2023 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-36108145

RESUMEN

In the quest for "missing proteins" (MPs), the proteins encoded by the human genome still lacking evidence of existence at the protein level, novel approaches are needed to detect this challenging group of proteins. The current count stands at 1,343 MPs, and it is likely that many of these proteins are expressed at low levels, in rare cell or tissue types, or the cells in which they are expressed may only represent a small minority of the tissue. Here, we used an integrated omics approach to identify and explore MPs in human ovaries. By taking advantage of publicly available transcriptomics and antibody-based proteomics data in the Human Protein Atlas (HPA), we selected 18 candidates for further immunohistochemical analysis using an exclusive collection of ovarian tissues from women and patients of reproductive age. The results were compared with data from single-cell mRNA sequencing, and seven proteins (CTXN1, MRO, RERGL, TTLL3, TRIM61, TRIM73, and ZNF793) could be validated at the single-cell type level with both methods. We present for the first time the cell type-specific spatial localization of 18 MPs in human ovarian follicles, thereby showcasing the utility of the HPA database as an important resource for identification of MPs suitable for exploration in specialized tissue samples. The results constitute a starting point for further quantitative and qualitative analysis of the human ovaries, and the novel data for the seven proteins that were validated with both methods should be considered as evidence of existence of these proteins in human ovary.


Asunto(s)
Ovario , Proteómica , Humanos , Femenino , Ovario/química , Proteómica/métodos , Proteínas/metabolismo , Anticuerpos/metabolismo , Perfilación de la Expresión Génica , Proteoma/genética , Proteoma/análisis
8.
J Proteome Res ; 22(4): 1024-1042, 2023 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-36318223

RESUMEN

The 2022 Metrics of the Human Proteome from the HUPO Human Proteome Project (HPP) show that protein expression has now been credibly detected (neXtProt PE1 level) for 18 407 (93.2%) of the 19 750 predicted proteins coded in the human genome, a net gain of 50 since 2021 from data sets generated around the world and reanalyzed by the HPP. Conversely, the number of neXtProt PE2, PE3, and PE4 missing proteins has been reduced by 78 from 1421 to 1343. This represents continuing experimental progress on the human proteome parts list across all the chromosomes, as well as significant reclassifications. Meanwhile, applying proteomics in a vast array of biological and clinical studies continues to yield significant findings and growing integration with other omics platforms. We present highlights from the Chromosome-Centric HPP, Biology and Disease-driven HPP, and HPP Resource Pillars, compare features of mass spectrometry and Olink and Somalogic platforms, note the emergence of translation products from ribosome profiling of small open reading frames, and discuss the launch of the initial HPP Grand Challenge Project, "A Function for Each Protein".


Asunto(s)
Proteoma , Proteómica , Humanos , Proteoma/genética , Proteoma/análisis , Bases de Datos de Proteínas , Espectrometría de Masas/métodos , Sistemas de Lectura Abierta , Proteómica/métodos
9.
Mol Cell Proteomics ; 20: 100062, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33640492

RESUMEN

We celebrate the 10th anniversary of the launch of the HUPO Human Proteome Project (HPP) and its major milestone of confident detection of at least one protein from each of 90% of the predicted protein-coding genes, based on the output of the entire proteomics community. The Human Genome Project reached a similar decadal milestone 20 years ago. The HPP has engaged proteomics teams around the world, strongly influenced data-sharing, enhanced quality assurance, and issued stringent guidelines for claims of detecting previously "missing proteins." This invited perspective complements papers on "A High-Stringency Blueprint of the Human Proteome" and "The Human Proteome Reaches a Major Milestone" in special issues of Nature Communications and Journal of Proteome Research, respectively, released in conjunction with the October 2020 virtual HUPO Congress and its celebration of the 10th anniversary of the HUPO HPP.


Asunto(s)
Proteoma , Sociedades Científicas/historia , Exactitud de los Datos , Historia del Siglo XXI , Humanos , Difusión de la Información
10.
J Proteome Res ; 20(12): 5329-5339, 2021 12 03.
Artículo en Inglés | MEDLINE | ID: mdl-34748338

RESUMEN

With the steadfast development of proteomic technology, the number of missing proteins (MPs) has been continuously shrinking, with approximately 1470 MPs that have not been explored yet. Due to this phenomenon, the discovery of MPs has been increasingly more difficult and elusive. In order to face this challenge, we have hypothesized that a stable aneuploid cell line with increased chromosomes serves as a useful material for assisting MP exploration. Ker-CT cell line with trisomy at chromosome 5 and 20 was selected for this purpose. With a combination strategy of RNA-Seq and LC-MS/MS, a total of 22 178 transcripts and 8846 proteins were identified in Ker-CT. Although the transcripts corresponding to 15 and 15 MP genes located at chromosome 5 and 20 were detected, none of the MPs were found in Ker-CT. Surprisingly, 3 MPs containing at least two unique non-nest peptides of length ≥9 amino acids were identified in Ker-CT, whose genes are located on chromosome 3 and 10, respectively. Furthermore, the 3 MPs were verified using the method of parallel reaction monitoring (PRM). These results suggest that the abnormal status of chromosomes may not only impact the expression of the corresponding genes in trisomy chromosomes, but also influence that of other chromosomes, which benefits MP discovery. The data obtained in this study are available via ProteomeXchange (PXD028647) and PeptideAtlas (PASS01700), respectively.


Asunto(s)
Proteogenómica , Proteómica , Aneuploidia , Línea Celular , Cromatografía Liquida , Humanos , Proteómica/métodos , Espectrometría de Masas en Tándem/métodos
11.
J Proteome Res ; 20(12): 5227-5240, 2021 12 03.
Artículo en Inglés | MEDLINE | ID: mdl-34670092

RESUMEN

The 2021 Metrics of the HUPO Human Proteome Project (HPP) show that protein expression has now been credibly detected (neXtProt PE1 level) for 18 357 (92.8%) of the 19 778 predicted proteins coded in the human genome, a gain of 483 since 2020 from reports throughout the world reanalyzed by the HPP. Conversely, the number of neXtProt PE2, PE3, and PE4 missing proteins has been reduced by 478 to 1421. This represents remarkable progress on the proteome parts list. The utilization of proteomics in a broad array of biological and clinical studies likewise continues to expand with many important findings and effective integration with other omics platforms. We present highlights from the Immunopeptidomics, Glycoproteomics, Infectious Disease, Cardiovascular, Musculo-Skeletal, Liver, and Cancers B/D-HPP teams and from the Knowledgebase, Mass Spectrometry, Antibody Profiling, and Pathology resource pillars, as well as ethical considerations important to the clinical utilization of proteomics and protein biomarkers.


Asunto(s)
Benchmarking , Proteoma , Bases de Datos de Proteínas , Humanos , Espectrometría de Masas/métodos , Proteoma/análisis , Proteoma/genética , Proteómica/métodos
12.
J Proteome Res ; 19(12): 4808-4814, 2020 12 04.
Artículo en Inglés | MEDLINE | ID: mdl-33172275

RESUMEN

The Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2012 to perfect the annotation of human protein existence by identifying stronger evidence of the expression of missing proteins (MPs) at the protein level. After an 8 year effort all over the world, the number of MPs in the neXtProt database significantly decreased from 5511 (2012-02-24) to 1899 (2020-01-17). It is now more difficult to provide confident evidence of the remaining MPs because of their specific characteristics, including low abundance, low molecular weight, unexpected modifications, transmembrane structure, tissue-expression specificity, and so on. A higher resolution mass spectrometry (MS) interpretation engine might provide an opportunity to identify these buried MPs in complex samples by the combination with multi-tissue large-scale proteomics. In this study, open-pFind was used to dig MPs from 20 pairs of healthy human tissues by Wang et al. ( Mol. Syst. Biol. 2019, 15 (2), e8503) combined with our large-scale testis data set digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity for different amino acid residues ( J. Proteme Res. 2019, 18 (12), 4189-4196). A total of 1 535 536 peptides with 17 283 477 peptide-spectrum matches (PSMs) were mapped to 14 279 protein entries at a false discovery rate of <1% at the PSM, peptide, and protein levels. A total of 103 MP candidates were identified, among which 86 candidates had more unique peptide numbers compared with our single testis tissue. After rigorous screening, manual checks, peptide synthesis, and matching with documented peptides from PeptideAtlas, we validated four MPs, P0C7T8 (duodenum and small intestine), Q8WWZ4 (stomach and rectum), Q8IV35 (fallopian tube), and O14921 (tonsil), at the protein level. All MS raw files have been deposited to the ProteomeXchange with identifier PXD021391.


Asunto(s)
Proteoma , Proteómica , Femenino , Humanos , Masculino , Espectrometría de Masas , Peso Molecular , Péptidos
13.
J Proteome Res ; 19(1): 401-408, 2020 01 03.
Artículo en Inglés | MEDLINE | ID: mdl-31773964

RESUMEN

The mission of the Chromosome-Centric Human Proteome Project (C-HPP) to discover missing proteins (MPs) has become increasingly difficult due to the remaining low-abundance, high-hydrophobicity, or low-molecular-weight MPs. We have reported two approaches to resolve these identification problems for the low-abundance and high-hydrophobicity MPs, respectively. In this study, to improve the identification of low-abundance MPs with high hydrophobicity, we combined two approaches and obtained MPs from several different cancer cell lines. Their membrane fractions were isolated by ultracentrifugation, and the low-abundance proteins were enriched at the protein level with the ProteoMiner kit. After that, the peptides from the enriched proteins were separated by high concentrations of organic solvents according to their hydrophobicity as the first dimension of separation at the peptide level, and the second and third dimensions of separation involved a high pH reversed-phase and an acid reversed-phase column, respectively. In total, 16 MPs (at least two non-nested unique peptides with ≥9 amino acids) with 61 unique peptides were identified from four human cancer cell lines, including 2, 8, 2, and 7 MPs from HeLa, HCT116, SNU-1, and HepG2 cells, respectively. Furthermore, all MPs were verified with two non-nested unique peptides through parallel reaction monitoring (PRM) by matching the peptides with their chemically synthesized peptides. Interestingly, two additional MPs were verified from the same cell line by PRM assay, although the two non-nested unique peptides with ≥9 amino acids for each MP were identified from different MS injections or cell lines by data-dependent acquisition (DDA). Thus, a total of 18 MPs were dug out in this study. The data are available via ProteomeXchange (PXD014058) and PeptideAtlas (PASS01388).


Asunto(s)
Proteínas/análisis , Proteínas/química , Proteómica/métodos , Línea Celular Tumoral , Electroforesis en Gel de Poliacrilamida , Humanos , Interacciones Hidrofóbicas e Hidrofílicas , Espectrometría de Masas/métodos
14.
J Proteome Res ; 19(12): 4857-4866, 2020 12 04.
Artículo en Inglés | MEDLINE | ID: mdl-33210925

RESUMEN

Since the Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2010, many techniques have been adopted for the discovery of missing proteins (MPs). Because of these efforts, only 1481 MPs remained as of July 2020; however, by relying only on technique optimization, researchers have reached a bottleneck in MP discovery. Protein expression is tissue- or cell-type-dependent. The tissues of the human testis and brain have been reported to harbor a large number of tissue-specific genes and proteins; however, few studies have been performed on human brain tissue or cells to identify MPs. Herein a metastatic cell line derived from brain cancer, D283 Med, was used to search for MPs. With a traditional and simple shotgun workflow to separate the peptides into 20 fractions, 12 MPs containing at least two unique non-nested peptides (amino acid length ≥9) were identified in this cell line with a protein false discovery rate of <1%. Following the same experimental protocol, only one MP was found in a nonmetastatic brain cancer cell line, U-118 MG. Furthermore, 12 MPs were verified as having two non-nested unique peptides by matching them with corresponding chemically synthesized peptides through parallel reaction monitoring. These results clearly demonstrate that the appropriate selection of experimental materials, either tissues or cell lines, is imperative for MP discovery. The data obtained in this study are available via ProteomeXchange (PXD021482) and PeptideAtlas (PASS01627).


Asunto(s)
Neoplasias Cerebelosas , Meduloblastoma , Línea Celular , Humanos , Masculino , Meduloblastoma/genética , Péptidos , Proteómica
15.
J Proteome Res ; 19(12): 4766-4781, 2020 12 04.
Artículo en Inglés | MEDLINE | ID: mdl-33170010

RESUMEN

The localization of proteins at a tissue- or cell-type-specific level is tightly linked to the protein function. To better understand each protein's role in cellular systems, spatial information constitutes an important complement to quantitative data. The standard methods for determining the spatial distribution of proteins in single cells of complex tissue samples make use of antibodies. For a stringent analysis of the human proteome, we used orthogonal methods and independent antibodies to validate 5981 antibodies that show the expression of 3775 human proteins across all major human tissues. This enhanced validation uncovered 56 proteins corresponding to the group of "missing proteins" and 171 proteins of unknown function. The presented strategy will facilitate further discussions around criteria for evidence of protein existence based on immunohistochemistry and serves as a useful guide to identify candidate proteins for integrative studies with quantitative proteomics methods.


Asunto(s)
Proteoma , Proteómica , Anticuerpos , Humanos , Inmunohistoquímica
16.
J Proteome Res ; 19(12): 4747-4753, 2020 12 04.
Artículo en Inglés | MEDLINE | ID: mdl-33124832

RESUMEN

The Chromosome-Centric Human Proteome Project (C-HPP) aims at the identification of missing proteins (MPs) and the functional characterization of functionally unannotated PE1 (uPE1) proteins. A major challenge in addressing this goal is that many human proteins and MPs are silent in adult cells. A promising approach to overcome such challenge is to exploit the advantage of novel tools such as pluripotent stem cells (PSCs), which are capable of differentiation into three embryonic germ layers, namely, the endoderm, mesoderm, and ectoderm. Here we present several examples of how the Human Y Chromosome Proteome Project (Y-HPP) benefited from this approach to meet C-HPP goals. Furthermore, we discuss how integrating CRISPR engineering, human-induced pluripotent stem cell (hiPSC)-derived disease modeling systems, and organoid technologies provides a unique platform for Y-HPP and C-HPP for MP identification and the functional characterization of human proteins, especially uPE1s.


Asunto(s)
Células Madre Pluripotentes , Proteoma , Diferenciación Celular , Cromosomas Humanos Y , Humanos , Proteoma/genética
17.
J Proteome Res ; 19(12): 4735-4746, 2020 12 04.
Artículo en Inglés | MEDLINE | ID: mdl-32931287

RESUMEN

According to the 2020 Metrics of the HUPO Human Proteome Project (HPP), expression has now been detected at the protein level for >90% of the 19 773 predicted proteins coded in the human genome. The HPP annually reports on progress made throughout the world toward credibly identifying and characterizing the complete human protein parts list and promoting proteomics as an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2020-01 classified 17 874 proteins as PE1, having strong protein-level evidence, up 180 from 17 694 one year earlier. These represent 90.4% of the 19 773 predicted coding genes (all PE1,2,3,4 proteins in neXtProt). Conversely, the number of neXtProt PE2,3,4 proteins, termed the "missing proteins" (MPs), was reduced by 230 from 2129 to 1899 since the neXtProt 2019-01 release. PeptideAtlas is the primary source of uniform reanalysis of raw mass spectrometry data for neXtProt, supplemented this year with extensive data from MassIVE. PeptideAtlas 2020-01 added 362 canonical proteins between 2019 and 2020 and MassIVE contributed 84 more, many of which converted PE1 entries based on non-MS evidence to the MS-based subgroup. The 19 Biology and Disease-driven B/D-HPP teams continue to pursue the identification of driver proteins that underlie disease states, the characterization of regulatory mechanisms controlling the functions of these proteins, their proteoforms, and their interactions, and the progression of transitions from correlation to coexpression to causal networks after system perturbations. And the Human Protein Atlas published Blood, Brain, and Metabolic Atlases.


Asunto(s)
Proteoma , Proteómica , Bases de Datos de Proteínas , Genoma Humano , Humanos , Espectrometría de Masas , Proteoma/genética
18.
Cell Biol Toxicol ; 36(3): 261-272, 2020 06.
Artículo en Inglés | MEDLINE | ID: mdl-31599373

RESUMEN

In the advanced stages, malignant melanoma (MM) has a very poor prognosis. Due to tremendous efforts in cancer research over the last 10 years, and the introduction of novel therapies such as targeted therapies and immunomodulators, the rather dark horizon of the median survival has dramatically changed from under 1 year to several years. With the advent of proteomics, deep-mining studies can reach low-abundant expression levels. The complexity of the proteome, however, still surpasses the dynamic range capabilities of current analytical techniques. Consequently, many predicted protein products with potential biological functions have not yet been verified in experimental proteomic data. This category of 'missing proteins' (MP) is comprised of all proteins that have been predicted but are currently unverified. As part of the initiative launched in 2016 in the USA, the European Cancer Moonshot Center has performed numerous deep proteomics analyses on samples from MM patients. In this study, nine MPs were clearly identified by mass spectrometry in MM metastases. Some MPs significantly correlated with proteins that possess identical PFAM structural domains; and other MPs were significantly associated with cancer-related proteins. This is the first study to our knowledge, where unknown and novel proteins have been annotated in metastatic melanoma tumour tissue.


Asunto(s)
Melanoma/genética , Metástasis de la Neoplasia/genética , Proteómica/métodos , Adulto , Biomarcadores de Tumor/genética , Femenino , Genoma Humano/genética , Humanos , Masculino , Persona de Mediana Edad , Anotación de Secuencia Molecular/métodos , Anotación de Secuencia Molecular/tendencias , Pronóstico , Proteoma/genética , Proteoma/metabolismo , Neoplasias Cutáneas/genética , Melanoma Cutáneo Maligno
19.
J Proteome Res ; 18(12): 4197-4205, 2019 12 06.
Artículo en Inglés | MEDLINE | ID: mdl-31646870

RESUMEN

The Human Proteome Project (HPP) has made great efforts to clarify the existing evidence of human proteins since 2012. However, according to the recent release of neXtProt (2019-1), approximately 10% of all human genes still have inadequate or no experimental evidence of their translation at the protein level. They were categorized as missing proteins (PE2-PE4). To further the goal of HPP, we developed a two-step bioinformatic strategy addressing the utilization of the SRMAtlas synthetic peptides corresponding to the missing proteins as an exclusive reference in order to explore their natural counterparts within GPM. In the first step, we searched the GPM for the non-nested SRMAtlas peptides corresponding to the missing proteins, taking under consideration only those detected via ≥2 non-nested unitypic/proteotypic peptides "Stranded peptides" with length ≥9 amino acids in the same proteomic study. As a result, 51 missing proteins were newly detected in 35 different proteomic studies. In the second step, we validated these newly detected missing proteins based on matching the spectra of their synthetic and natural peptides in SRMAtlas and GPM, respectively. The results showed that 23 of the missing proteins with ≥2 non-nested peptides were validated by careful spectral matching.


Asunto(s)
Bases de Datos de Proteínas , Proteoma , Proteómica/métodos , Humanos , Péptidos/síntesis química , Mapas de Interacción de Proteínas , Reproducibilidad de los Resultados , Programas Informáticos
20.
J Proteome Res ; 18(12): 4189-4196, 2019 12 06.
Artículo en Inglés | MEDLINE | ID: mdl-31657219

RESUMEN

In recent years, high-throughput technologies have contributed to the development of a more precise picture of the human proteome. However, 2129 proteins remain listed as missing proteins (MPs) in the newest neXtProt release (2019-02). The main reasons for MPs are a low abundance, a low molecular weight, unexpected modifications, membrane characteristics, and so on. Moreover, >50% of the MS/MS data have not been successfully identified in shotgun proteomics. Open-pFind, an efficient open search engine, recently released by the pFind group in China, might provide an opportunity to identify these buried MPs in complex samples. In this study, proteins and potential MPs were identified using Open-pFind and three other search engines to compare their performance and efficiency with three large-scale data sets digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity on different amino acid (AA) residues. Our results demonstrated that Open-pFind identified 44.7-93.1% more peptide-spectrum matches and 21.3-61.6% more peptide sequences than the second-best search engine. As a result, Open-pFind detected 53.1% more MP candidates than MaxQuant and 8.8% more candidate MPs than Proteome Discoverer. In total, 5 (PE2) of the 124 MP candidates identified by Open-pFind were verified with 2 or 3 unique peptides containing more than 9 AAs by using a spectrum theoretical prediction with pDeep and synthesized peptide matching with pBuild after spectrum quality analysis, isobaric post-translational modification, and single amino acid variant filtering. These five verified MPs can be saved as PE1 proteins. In addition, three other MP candidates were verified with two unique peptides (one peptide containing more than 9 AAs and the other containing only 8 AAs), which was slightly lower than the criteria listed by C-HPP and required additional verification information. More importantly, unexpected modifications were detected in these MPs. All MS data sets have been deposited into ProteomeXchange with the identifier PXD015759.


Asunto(s)
Bases de Datos de Proteínas , Programas Informáticos , Testículo/química , Humanos , Masculino , Espectrometría de Masas , Procesamiento Proteico-Postraduccional , Proteínas/análisis , Proteínas/genética , Proteínas/metabolismo , Proteómica/métodos , Motor de Búsqueda
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA