Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 43
Filter
1.
Biochemistry (Mosc) ; 89(8): 1474-1489, 2024 Aug.
Article in English | MEDLINE | ID: mdl-39245456

ABSTRACT

Expansion of CAG repeats in certain genes is a known cause of several neurodegenerative diseases, but exact mechanism behind this is not yet fully understood. It is believed that the double-stranded RNA regions formed by CAG repeats could be harmful to the cell. This study aimed to test the hypothesis that these RNA regions might potentially interfere with ADAR RNA editing enzymes, leading to the reduced A-to-I editing of RNA and activation of the interferon response. We studied induced pluripotent stem cells (iPSCs) derived from the patients with Huntington's disease or ataxia type 17, as well as midbrain organoids developed from these cells. A targeted panel for next-generation sequencing was used to assess editing in the specific RNA regions. Differentiation of iPSCs into brain organoids led to increase in the ADAR2 gene expression and decrease in the expression of protein inhibitors of RNA editing. As a result, there was increase in the editing of specific ADAR2 substrates, which allowed identification of differential substrates of ADAR isoforms. However, comparison of the pathology and control groups did not show differences in the editing levels among the iPSCs. Additionally, brain organoids with 42-46 CAG repeats did not exhibit global changes. On the other hand, brain organoids with the highest number of CAG repeats in the huntingtin gene (76) showed significant decrease in the level of RNA editing of specific transcripts, potentially involving ADAR1. Notably, editing of the long non-coding RNA PWAR5 was nearly absent in this sample. It could be stated in conclusion that in most cultures with repeat expansion, the hypothesized effect on RNA editing was not confirmed.


Subject(s)
Adenosine Deaminase , Brain , Cell Differentiation , Huntington Disease , Induced Pluripotent Stem Cells , Organoids , RNA Editing , RNA-Binding Proteins , Adenosine Deaminase/metabolism , Adenosine Deaminase/genetics , Humans , Organoids/metabolism , Induced Pluripotent Stem Cells/metabolism , Induced Pluripotent Stem Cells/cytology , RNA-Binding Proteins/metabolism , RNA-Binding Proteins/genetics , Brain/metabolism , Huntington Disease/genetics , Huntington Disease/metabolism , Huntington Disease/pathology , Trinucleotide Repeat Expansion
2.
J Proteome Res ; 22(6): 1695-1711, 2023 06 02.
Article in English | MEDLINE | ID: mdl-37158322

ABSTRACT

The proteogenomic search pipeline developed in this work has been applied for reanalysis of 40 publicly available shotgun proteomic datasets from various human tissues comprising more than 8000 individual LC-MS/MS runs, of which 5442 .raw data files were processed in total. This reanalysis was focused on searching for ADAR-mediated RNA editing events, their clustering across samples of different origins, and classification. In total, 33 recoded protein sites were identified in 21 datasets. Of those, 18 sites were detected in at least two datasets, representing the core human protein editome. In agreement with prior artworks, neural and cancer tissues were found to be enriched with recoded proteins. Quantitative analysis indicated that recoding the rate of specific sites did not directly depend on the levels of ADAR enzymes or targeted proteins themselves, rather it was governed by differential and yet undescribed regulation of interaction of enzymes with mRNA. Nine recoding sites conservative between humans and rodents were validated by targeted proteomics using stable isotope standards in the murine brain cortex and cerebellum, and an additional one was validated in human cerebrospinal fluid. In addition to previous data of the same type from cancer proteomes, we provide a comprehensive catalog of recoding events caused by ADAR RNA editing in the human proteome.


Subject(s)
Proteogenomics , Proteomics , Humans , Animals , Mice , RNA/metabolism , RNA Editing , Chromatography, Liquid , Tandem Mass Spectrometry , Proteome/genetics , Proteome/metabolism , Adenosine/metabolism , Inosine/genetics , Inosine/metabolism
3.
Int J Mol Sci ; 24(3)2023 Jan 27.
Article in English | MEDLINE | ID: mdl-36768787

ABSTRACT

Alternative splicing is one of the main regulation pathways in living cells beyond simple changes in the level of protein expression. Most of the approaches proposed in proteomics for the identification of specific splicing isoforms require a preliminary deep transcriptomic analysis of the sample under study, which is not always available, especially in the case of the re-analysis of previously acquired data. Herein, we developed new algorithms for the identification and validation of protein splice isoforms in proteomic data in the absence of RNA sequencing of the samples under study. The bioinformatic approaches were tested on the results of proteome analysis of human melanoma cell lines, obtained earlier by high-resolution liquid chromatography and mass spectrometry (LC-MS). A search for alternative splicing events for each of the cell lines studied was performed against the database generated from all known transcripts (RefSeq) and the one composed of peptide sequences, which included all biologically possible combinations of exons. The identifications were filtered using the prediction of both retention times and relative intensities of fragment ions in the corresponding mass spectra. The fragmentation mass spectra corresponding to the discovered alternative splicing events were additionally examined for artifacts. Selected splicing events were further validated at the mRNA level by quantitative PCR.


Subject(s)
Alternative Splicing , Melanoma , Humans , Alternative Splicing/genetics , Proteome/genetics , Proteome/metabolism , Proteomics/methods , RNA/metabolism , Protein Isoforms/genetics , Protein Isoforms/metabolism , Sequence Analysis, RNA , RNA Splicing , Cell Line , Melanoma/genetics
4.
Int J Mol Sci ; 24(1)2023 Jan 01.
Article in English | MEDLINE | ID: mdl-36614211

ABSTRACT

A meta-analysis of the results of targeted quantitative screening of human blood plasma was performed to generate a reference standard kit that can be used for health analytics. The panel included 53 of the 296 proteins that form a "stable" part of the proteome of a healthy individual; these proteins were found in at least 70% of samples and were characterized by an interindividual coefficient of variation <40%. The concentration range of the selected proteins was 10−10−10−3 M and enrichment analysis revealed their association with rare familial diseases. The concentration of ceruloplasmin was reduced by approximately three orders of magnitude in patients with neurological disorders compared to healthy volunteers, and those of gelsolin isoform 1 and complement factor H were abruptly reduced in patients with lung adenocarcinoma. Absolute quantitative data of the individual proteome of a healthy and diseased individual can be used as the basis for personalized medicine and health monitoring. Storage over time allows us to identify individual biomarkers in the molecular landscape and prevent pathological conditions.


Subject(s)
Blood Proteins , Plasma , Proteome , Humans , Blood Proteins/metabolism , Ceruloplasmin/metabolism , Mass Spectrometry/methods , Plasma/metabolism , Proteomics
5.
J Proteome Res ; 21(6): 1438-1448, 2022 06 03.
Article in English | MEDLINE | ID: mdl-35536917

ABSTRACT

Mass spectrometry-based proteome analysis implies matching the mass spectra of proteolytic peptides to amino acid sequences predicted from genomic sequences. Reliability of peptide variant identification in proteogenomic studies is often lacking. We propose a way to interpret shotgun proteomics results, specifically in the data-dependent acquisition mode, as protein sequence coverage by multiple reads as it is done in nucleic acid sequencing for calling of single nucleotide variants. Multiple reads for each sequence position could be provided by overlapping distinct peptides, thus confirming the presence of certain amino acid residues in the overlapping stretch with a lower false discovery rate. Overlapping distinct peptides originate from miscleaved tryptic peptides in combination with their properly cleaved counterparts and from peptides generated by multiple proteases after the same specimen is subject to parallel digestion and analyzed separately. We illustrate this approach using publicly available multiprotease data sets and our own data generated for the HEK-293 cell line digests obtained using trypsin, LysC, and GluC proteases. Totally, up to 30% of the whole proteome was covered by tryptic peptides with up to 7% covered twofold and more. The proteogenomic analysis of the HEK-293 cell line revealed 36 single amino acid variants, seven of which were supported by multiple reads.


Subject(s)
Proteogenomics , Amino Acids , HEK293 Cells , Humans , Peptide Hydrolases , Peptides/analysis , Proteogenomics/methods , Proteome/analysis , Reproducibility of Results
6.
Proc Biol Sci ; 289(1980): 20221152, 2022 08 10.
Article in English | MEDLINE | ID: mdl-35946162

ABSTRACT

Venoms of predatory marine cone snails are intensely studied because of the biomedical applications of the neuropeptides that they contain, termed conotoxins. Meanwhile some gastropod lineages have independently acquired secretory glands strikingly similar to the venom gland of cone snails, suggesting that they possess similar venoms. Here we focus on the most diversified of these clades, the genus Vexillum. Based on the analysis of a multi-species proteo-transcriptomic dataset, we show that Vexillum species indeed produce complex venoms dominated by highly diversified short cysteine-rich peptides, vexitoxins. Vexitoxins possess the same precursor organization, display overlapping cysteine frameworks and share several common post-translational modifications with conotoxins. Some vexitoxins show sequence similarity to conotoxins and adopt similar domain conformations, including a pharmacologically relevant inhibitory cysteine knot motif. The Vexillum envenomation gland (gL) is a notably more recent evolutionary novelty than the conoidean venom gland. Thus, we hypothesize lower divergence between vexitoxin genes, and their ancestral 'somatic' counterparts compared to that in conotoxins, and we find support for this hypothesis in the evolution of the vexitoxin cluster V027. We use this example to discuss how future studies on vexitoxins can inform the origin of conotoxins, and how they may help to address outstanding questions in venom evolution.


Subject(s)
Conotoxins , Conus Snail , Animals , Conotoxins/genetics , Conus Snail/chemistry , Conus Snail/genetics , Cysteine , Peptides/chemistry , Snails , Venoms
7.
Biochemistry (Mosc) ; 87(11): 1301-1309, 2022 Nov.
Article in English | MEDLINE | ID: mdl-36509721

ABSTRACT

RNA editing by adenosine deaminases of the ADAR family can lead to protein recoding, since inosine formed from adenosine in mRNA is complementary to cytosine; the resulting codon editing might introduce amino acid substitutions into translated proteins. Proteome recoding can have functional consequences which have been described in many animals including humans. Using protein recoding database derived from publicly available transcriptome data, we identified for the first time the recoding sites in the zebrafish shotgun proteomes. Out of more than a hundred predicted recoding events, ten substitutions were found in six used datasets. Seven of them were in the AMPA glutamate receptor subunits, whose recoding has been well described, and are conserved among vertebrates. Three sites were specific for zebrafish proteins and were found in the transmembrane receptors astrotactin 1 and neuregulin 3b (proteins involved in the neuronal adhesion and signaling) and in the rims2b gene product (presynaptic membrane protein participating in the neurotransmitter release), respectively. Further studies are needed to elucidate the role of recoding of the said three proteins in the zebrafish.


Subject(s)
Proteomics , Zebrafish , Animals , Humans , Zebrafish/genetics , Zebrafish/metabolism , Proteomics/methods , Zebrafish Proteins/genetics , Adenosine Deaminase/genetics , Adenosine Deaminase/metabolism , Proteome/metabolism , Adenosine/metabolism , RNA, Messenger/genetics
8.
Int J Mol Sci ; 23(9)2022 May 08.
Article in English | MEDLINE | ID: mdl-35563631

ABSTRACT

Adenosine-to-inosine RNA editing is a system of post-transcriptional modification widely distributed in metazoans which is catalyzed by ADAR enzymes and occurs mostly in double-stranded RNA (dsRNA) before splicing. This type of RNA editing changes the genetic code, as inosine generally pairs with cytosine in contrast to adenosine, and this expectably modulates RNA splicing. We review the interconnections between RNA editing and splicing in the context of human cancer. The editing of transcripts may have various effects on splicing, and resultant alternatively spliced isoforms may be either tumor-suppressive or oncogenic. Dysregulated RNA splicing in cancer often causes the release of excess amounts of dsRNA into cytosol, where specific dsRNA sensors provoke antiviral-like responses, including type I interferon signaling. These responses may arrest cell division, causing apoptosis and, externally, stimulate antitumor immunity. Thus, small-molecule spliceosome inhibitors have been shown to facilitate the antiviral-like signaling and are considered to be potential cancer therapies. In turn, a cytoplasmic isoform of ADAR can deaminate dsRNA in cytosol, thereby decreasing its levels and diminishing antitumor innate immunity. We propose that complete or partial inhibition of ADAR may enhance the proapoptotic and cytotoxic effects of splicing inhibitors and that it may be considered a promising addition to cancer therapies targeting RNA splicing.


Subject(s)
Adenosine Deaminase , Neoplasms , Adenosine/metabolism , Adenosine Deaminase/genetics , Adenosine Deaminase/metabolism , Antiviral Agents , Humans , Inosine/metabolism , Neoplasms/drug therapy , Neoplasms/genetics , RNA Splicing , RNA, Double-Stranded/genetics , RNA-Binding Proteins/metabolism
9.
Int J Mol Sci ; 23(9)2022 May 08.
Article in English | MEDLINE | ID: mdl-35563635

ABSTRACT

Cancer cell lines responded differentially to type I interferon treatment in models of oncolytic therapy using vesicular stomatitis virus (VSV). Two opposite cases were considered in this study, glioblastoma DBTRG-05MG and osteosarcoma HOS cell lines exhibiting resistance and sensitivity to VSV after the treatment, respectively. Type I interferon responses were compared for these cell lines by integrative analysis of the transcriptome, proteome, and RNA editome to identify molecular factors determining differential effects observed. Adenosine-to-inosine RNA editing was equally induced in both cell lines. However, transcriptome analysis showed that the number of differentially expressed genes was much higher in DBTRG-05MG with a specific enrichment in inflammatory proteins. Further, it was found that two genes, EGFR and HER2, were overexpressed in HOS cells compared with DBTRG-05MG, supporting recent reports that EGF receptor signaling attenuates interferon responses via HER2 co-receptor activity. Accordingly, combined treatment of cells with EGF receptor inhibitors such as gefitinib and type I interferon increases the resistance of sensitive cell lines to VSV. Moreover, sensitive cell lines had increased levels of HER2 protein compared with non-sensitive DBTRG-05MG. Presumably, the level of this protein expression in tumor cells might be a predictive biomarker of their resistance to oncolytic viral therapy.


Subject(s)
Interferon Type I , Oncolytic Virotherapy , Oncolytic Viruses , Vesicular Stomatitis , Animals , Cell Line, Tumor , ErbB Receptors/genetics , Interferon Type I/metabolism , Oncolytic Viruses/physiology , Vesicular stomatitis Indiana virus/genetics , Vesiculovirus/physiology
10.
J Proteome Res ; 19(10): 4046-4060, 2020 10 02.
Article in English | MEDLINE | ID: mdl-32866021

ABSTRACT

Adenosine-to-inosine RNA editing is an enzymatic post-transcriptional modification which modulates immunity and neural transmission in multicellular organisms. In particular, it involves editing of mRNA codons with the resulting amino acid substitutions. We identified such sites for developmental proteomes of Drosophila melanogaster at the protein level using available data for 15 stages of fruit fly development from egg to imago and 14 time points of embryogenesis. In total, 40 sites were obtained, each belonging to a unique protein, including four sites related to embryogenesis. The interactome analysis has revealed that the majority of the editing-recoded proteins were associated with synaptic vesicle trafficking and actomyosin organization. Quantitation data analysis suggested the existence of a phase-specific RNA editing regulation with yet unknown mechanisms. These findings supported the transcriptome analysis results, which showed that a burst in the RNA editing occurs during insect metamorphosis from pupa to imago. Finally, targeted proteomic analysis was performed to quantify editing-recoded and genomically encoded versions of five proteins in brains of larvae, pupae, and imago insects, which showed a clear tendency toward an increase in the editing rate for each of them. These results will allow a better understanding of the protein role in physiological effects of RNA editing.


Subject(s)
Drosophila Proteins , RNA Editing , Adenosine Deaminase/genetics , Adenosine Deaminase/metabolism , Animals , Drosophila Proteins/genetics , Drosophila Proteins/metabolism , Drosophila melanogaster/genetics , Drosophila melanogaster/metabolism , Inosine/metabolism , Proteome/genetics , Proteome/metabolism , Proteomics , RNA, Messenger/genetics
11.
Cancer Cell Int ; 20: 488, 2020.
Article in English | MEDLINE | ID: mdl-33041669

ABSTRACT

INTRODUCTION: Cancers may be treated by selective targeting of the genes vital for their survival. A number of attempts have led to discovery of several genes essential for surviving of tumor cells of different types. In this work, we tried to analyze genes that were previously predicted to be essential for melanoma surviving. Here we present the results of transient siRNA-mediated knockdown of the four of such genes, namely, UNC45A, STK11IP, RHPN2 and ZNFX1, in melanoma cell line A375, then assayed the cells for their viability, proliferation and ability to migrate in vitro. In our study, the knockdown of the genes predicted as essential for melanoma survival does not lead to statistically significant changes in cell viability. On the other hand, for each of the studied genes, mobility assays showed that the knockdown of each of the target genes accelerates the speed of cells migrating. Possible explanation for such counterintuitive results may include insufficiency of the predicting computational models or the necessity of a multiplex knockdown of the genes. AIMS: To examine the hypothesis of essentiality of hypomutated genes for melanoma surviving we have performed knockdown of several genes in melanoma cell line and analyzed cell viability and their ability to migrate. METHODS: Knockdown was performed by siRNAs transfected by Metafectene PRO. The levels of mRNAs before and after knockdown were evaluated by RT-qPCR analysis. Cell viability and proliferation were assessed by MTT assay. Cell migration was assessed by wound healing assay. RESULTS: The knockdown of the genes predicted as essential for melanoma survival does not lead to statistically significant changes in cell viability. On the other hand, for each of the studied genes, mobility assays showed that the knockdown of each of the target genes accelerates the speed of cells migrating. CONCLUSION: Our results do not confirm initial hypothesis that the genes predicted essential for melanoma survival as a matter of fact support the survival of melanoma cells.

12.
Proteomics ; 19(23): e1900195, 2019 12.
Article in English | MEDLINE | ID: mdl-31576663

ABSTRACT

Proteogenomics is based on the use of customized genome or RNA sequencing databases for interrogation of shotgun proteomics data in search for proteome-level evidence of genome variations or RNA editing. In this work, the products of adenosine-to-inosine RNA editing in human and murine brain proteomes are identified using publicly available brain proteome LC-MS/MS datasets and an RNA editome database compiled from several sources. After filtering of false-positive results, 20 and 37 sites of editing in proteins belonging to 14 and 32 genes are identified for murine and human brain proteomes, respectively. Eight sites of editing identified with high spectral counts overlapped between human and mouse brain samples. Some of these sites have been previously reported using orthogonal methods, such as α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid (AMPA) glutamate receptors, CYFIP2, coatomer alpha. Also, differential editing between neurons and microglia is demonstrated in this work for some of the proteins from primary murine brain cell cultures. Because many edited sites are still not characterized functionally at the protein level, the results provide a necessary background for their further analysis in normal and diseased cells and tissues using targeted proteomic approaches.


Subject(s)
Adenosine/metabolism , Brain/metabolism , Inosine/metabolism , RNA Editing/genetics , Adaptor Proteins, Signal Transducing/metabolism , Animals , Cells, Cultured , Coatomer Protein/metabolism , Humans , Mice , Proteome/metabolism , Proteomics/methods
13.
Anal Chem ; 91(10): 6709-6715, 2019 05 21.
Article in English | MEDLINE | ID: mdl-31042365

ABSTRACT

Identification of isomeric amino acid residues in peptides and proteins is challenging but often highly desired in proteomics. One of the practically important cases that require isomeric assignments is that associated with single-nucleotide polymorphism substitutions of Met residues by Thr in cancer-related proteins. These genetically encoded substitutions can yet be confused with the chemical modifications, arising from protein alkylation by iodoacetamide, which is commonly used in the standard procedure of sample preparation for proteomic analysis. Similar to the genetically encoded mutations, the alkylation also induces a conversion of methionine residues, but to the iso-threonine form. Recognition of the mutations therefore requires isoform-sensitive detection techniques. Herein, we demonstrate an analytical method for reliable identification of isoforms of threonine residues in tryptic peptides. It is based on ultraviolet photodissociation mass spectrometry of cryogenically cooled ions and a machine-learning algorithm. The measured photodissociation mass spectra exhibit isoform-specific patterns, which are independent of the residues adjacent to threonine or iso-threonine in a peptide sequence. A comprehensive metric-based evaluation demonstrates that, being calibrated with a set of model peptides, the method allows for isomeric identification of threonine residues in peptides of arbitrary sequence.


Subject(s)
Mass Spectrometry/methods , Peptides/analysis , Threonine/analysis , Isomerism , Machine Learning , Peptides/chemistry , Peptides/radiation effects , Threonine/chemistry , Ultraviolet Rays
14.
J Proteome Res ; 17(5): 1801-1811, 2018 05 04.
Article in English | MEDLINE | ID: mdl-29619825

ABSTRACT

The identification of genetically encoded variants at the proteome level is an important problem in cancer proteogenomics. The generation of customized protein databases from DNA or RNA sequencing data is a crucial stage of the identification workflow. Genomic data filtering applied at this stage may significantly modify variant search results, yet its effect is generally left out of the scope of proteogenomic studies. In this work, we focused on this impact using data of exome sequencing and LC-MS/MS analyses of six replicates for eight melanoma cell lines processed by a proteogenomics workflow. The main objectives were identifying variant peptides and revealing the role of the genomic data filtering in the variant identification. A series of six confidence thresholds for single nucleotide polymorphisms and indels from the exome data were applied to generate customized sequence databases of different stringency. In the searches against unfiltered databases, between 100 and 160 variant peptides were identified for each of the cell lines using X!Tandem and MS-GF+ search engines. The recovery rate for variant peptides was ∼1%, which is approximately three times lower than that of the wild-type peptides. Using unfiltered genomic databases for variant searches resulted in higher sensitivity and selectivity of the proteogenomic workflow and positively affected the ability to distinguish the cell lines based on variant peptide signatures.


Subject(s)
Databases, Protein , Exome/genetics , Genetic Variation , Melanoma/pathology , Proteogenomics/methods , Animals , Cell Line, Tumor , Chromatography, Liquid , Humans , INDEL Mutation , Polymorphism, Single Nucleotide , Proteomics/methods , Search Engine , Tandem Mass Spectrometry
15.
J Proteome Res ; 17(11): 3889-3903, 2018 11 02.
Article in English | MEDLINE | ID: mdl-30298734

ABSTRACT

Adenosine-to-inosine RNA editing is one of the most common types of RNA editing, a posttranscriptional modification made by special enzymes. We present a proteomic study on this phenomenon for Drosophila melanogaster. Three proteome data sets were used in the study: two taken from public repository and the third one obtained here. A customized protein sequence database was generated using results of genome-wide adenosine-to-inosine RNA studies and applied for identifying the edited proteins. The total number of 68 edited peptides belonging to 59 proteins was identified in all data sets. Eight of them being shared between the whole insect, head, and brain proteomes. Seven edited sites belonging to synaptic vesicle and membrane trafficking proteins were selected for validation by orthogonal analysis by Multiple Reaction Monitoring. Five editing events in cpx, Syx1A, Cadps, CG4587, and EndoA were validated in fruit fly brain tissue at the proteome level using isotopically labeled standards. Ratios of unedited-to-edited proteoforms varied from 35:1 ( Syx1A) to 1:2 ( EndoA). Lys-137 to Glu editing of endophilin A may have functional consequences for its interaction to membrane. The work demonstrates the feasibility to identify the RNA editing event at the proteome level using shotgun proteomics and customized edited protein database.


Subject(s)
Adenosine/metabolism , Drosophila melanogaster/genetics , Inosine/metabolism , Insect Proteins/genetics , Proteogenomics/methods , RNA Editing , Acyltransferases/chemistry , Acyltransferases/genetics , Acyltransferases/metabolism , Adenosine Deaminase/genetics , Adenosine Deaminase/metabolism , Amino Acid Sequence , Animals , Base Sequence , Brain/metabolism , Databases, Protein , Datasets as Topic , Drosophila Proteins/chemistry , Drosophila Proteins/genetics , Drosophila Proteins/metabolism , Drosophila melanogaster/chemistry , Drosophila melanogaster/metabolism , Insect Proteins/classification , Insect Proteins/metabolism , Models, Molecular , Molecular Sequence Annotation , Proteome/genetics , Proteome/metabolism , Qa-SNARE Proteins/genetics , Qa-SNARE Proteins/metabolism , Synaptic Vesicles/chemistry , Synaptic Vesicles/metabolism
16.
Anal Bioanal Chem ; 410(16): 3827-3833, 2018 Jun.
Article in English | MEDLINE | ID: mdl-29663059

ABSTRACT

Recent advances in mass spectrometry and separation technologies created the opportunities for deep proteome characterization using shotgun proteomics approaches. The "real world" sample complexity and high concentration range limit the sensitivity of this characterization. The common strategy for increasing the sensitivity is sample fractionation prior to analysis either at the protein or the peptide level. Typically, fractionation at the peptide level is performed using linear gradient high-performance liquid chromatography followed by uniform fraction collection. However, this way of peptide fractionation results in significantly suboptimal operation of the mass spectrometer due to the non-uniform distribution of peptides between the fractions. In this work, we propose an approach based on peptide retention time prediction allowing optimization of chromatographic conditions and fraction collection procedures. An open-source software implementing the approach called FractionOptimizer was developed and is available at http://hg.theorchromo.ru/FractionOptimizer . The performance of the developed tool was demonstrated for human embryonic kidney (HEK293) cell line lysate. In these experiments, we improved the uniformity of the peptides distribution between fractions. Moreover, in addition to 13,492 peptides, we found 6787 new peptides not identified in the experiments without fractionation and up to 800 new proteins (or 25%). Graphical abstract The analysis workflow employing FractionOptimizer software.


Subject(s)
Chromatography, Reverse-Phase/methods , Peptides/analysis , Proteins/chemistry , Proteomics/methods , Chromatography, High Pressure Liquid/methods , HEK293 Cells , Humans , Proteome/chemistry , Software , Tandem Mass Spectrometry/methods
17.
J Proteome Res ; 16(5): 1936-1943, 2017 05 05.
Article in English | MEDLINE | ID: mdl-28317375

ABSTRACT

Proteogenomic studies aiming at identification of variant peptides using customized database searches of mass spectrometry data are facing a dilemma of selecting the most efficient database search strategy: A choice has to be made between using combined or sequential searches against reference (wild-type) and mutant protein databases or directly against the mutant database without the wild-type one. Here we called these approaches "all-together", "one-by-one", and "direct", respectively. We share the results of the comparison of these search strategies obtained for large data sets of publicly available proteogenomic data. On the basis of the results of this evaluation, we found that the "all-together" strategy provided, in general, more variant peptide identifications compared with the "one-by-one" approach, while showing similar performance for some specific cases. To validate further the results of this study, we performed a control comparison of the strategies in question using publicly available data for a mixture of the annotated human protein standard UPS1 and E. coli. For these data, both "all-together" and "one-by-one" approaches showed similar sensitivity and specificity of the searches, while the "direct" approach resulted in an increased number of false identifications.


Subject(s)
Databases, Protein , Proteogenomics/methods , Databases, Factual , Escherichia coli Proteins , Humans , Mass Spectrometry , Mutant Proteins , Peptides/genetics , Proteogenomics/standards , Sensitivity and Specificity
18.
Proteomics ; 16(14): 1980-91, 2016 07.
Article in English | MEDLINE | ID: mdl-27233776

ABSTRACT

Genomic and proteomic data were integrated into the proteogenomic workflow to identify coding genomic variants of Human Embryonic Kidney 293 (HEK-293) cell line at the proteome level. Shotgun proteome data published by Geiger et al. (2012), Chick et al. (2015), and obtained in this work for HEK-293 were searched against the customized genomic database generated using exome data published by Lin et al. (2014). Overall, 112 unique variants were identified at the proteome level out of ∼1200 coding variants annotated in the exome. Seven identified variants were shared between all the three considered proteomic datasets, and 27 variants were found in any two datasets. Some of the found variants belonged to widely known genomic polymorphisms originated from the germline, while the others were more likely resulting from somatic mutations. At least, eight of the proteins bearing amino acid variants were annotated as cancer-related ones, including p53 tumor suppressor. In all the considered shotgun datasets, the variant peptides were at the ratio of 1:2.5 less likely being identified than the wild-type ones compared with the corresponding theoretical peptides. This can be explained by the presence of the so-called "passenger" mutations in the genes, which were never expressed in HEK-293 cells. All MS data have been deposited in the ProteomeXchange with the dataset identifier PXD002613 (http://proteomecentral.proteomexchange.org/dataset/PXD002613).


Subject(s)
Exome , Neoplasm Proteins/isolation & purification , Polymorphism, Genetic , Proteome/isolation & purification , Proteomics/methods , Amino Acid Sequence , Datasets as Topic , Gene Ontology , HEK293 Cells , Humans , Molecular Sequence Annotation , Mutation , Neoplasm Proteins/genetics , Neoplasm Proteins/metabolism , Proteome/genetics , Proteome/metabolism , Tumor Suppressor Protein p53/genetics , Tumor Suppressor Protein p53/isolation & purification , Tumor Suppressor Protein p53/metabolism
19.
Proteomics ; 16(13): 1938-46, 2016 07.
Article in English | MEDLINE | ID: mdl-27193151

ABSTRACT

Twenty-nine human aqueous humor samples from patients with eye diseases such as cataract and glaucoma with and without pseudoexfoliation syndrome were characterized by LC-high resolution MS analysis. In total, 269 protein groups were identified with 1% false discovery rate including 32 groups that were not reported previously for this biological fluid. Since the samples were analyzed individually, but not pooled, 36 proteins were identified in all samples, comprising the constitutive proteome of the fluid. The most dominant molecular function of aqueous humor proteins as determined by GO analysis is endopeptidase inhibitor activity. Label-free protein quantification showed no significant difference between glaucoma and cataract aqueous humor proteomes. At the same time, we found decrease in the level of apolipoprotein D as a marker of the pseudoexfoliation syndrome. The data are available from ProteomeXchange repository (PXD002623).


Subject(s)
Aqueous Humor/chemistry , Cataract/diagnosis , Exfoliation Syndrome/diagnosis , Glaucoma/diagnosis , Proteome/analysis , Aged , Aged, 80 and over , Apolipoproteins D/analysis , Biomarkers/analysis , Chromatography, Liquid , Humans , Middle Aged , Tandem Mass Spectrometry
20.
J Proteome Res ; 15(11): 4039-4046, 2016 11 04.
Article in English | MEDLINE | ID: mdl-27457493

ABSTRACT

This work was aimed at estimating the concentrations of proteins encoded by human chromosome 18 (Chr 18) in plasma samples of 54 healthy male volunteers (aged 20-47). These young persons have been certified by the medical evaluation board as healthy subjects ready for space flight training. Over 260 stable isotope-labeled peptide standards (SIS) were synthesized to perform the measurements of proteins encoded by Chr 18. Selected reaction monitoring (SRM) with SIS allowed an estimate of the levels of 84 of 276 proteins encoded by Chr 18. These proteins were quantified in whole and depleted plasma samples. Concentration of the proteins detected varied from 10-6 M (transthyretin, P02766) to 10-11 M (P4-ATPase, O43861). A minor part of the proteins (mostly representing intracellular proteins) was characterized by extremely high inter individual variations. The results provide a background for studies of a potential biomarker in plasma among proteins encoded by Chr 18. The SRM raw data are available in ProteomeXchange repository (PXD004374).


Subject(s)
Astronauts , Chromosomes, Human, Pair 18 , Plasma/chemistry , Proteome/analysis , Adenosine Triphosphatases/analysis , Adult , Healthy Volunteers , Humans , Middle Aged , Prealbumin/analysis , Young Adult
SELECTION OF CITATIONS
SEARCH DETAIL