Validating Amino Acid Variants in Proteogenomics Using Sequence Coverage by Multiple Reads.

Levitsky, Lev I; Kuznetsova, Ksenia G; Kliuchnikova, Anna A; Ilina, Irina Y; Goncharov, Anton O; Lobas, Anna A; Ivanov, Mark V; Lazarev, Vassili N; Ziganshin, Rustam H; Gorshkov, Mikhail V; Moshkovskii, Sergei A

Levitsky, Lev I; Kuznetsova, Ksenia G; Kliuchnikova, Anna A; Ilina, Irina Y; Goncharov, Anton O; Lobas, Anna A; Ivanov, Mark V; Lazarev, Vassili N; Ziganshin, Rustam H; Gorshkov, Mikhail V; Moshkovskii, Sergei A.

Afiliación

Levitsky LI; V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia.
Kuznetsova KG; Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.
Kliuchnikova AA; Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.
Ilina IY; Pirogov Russian National Research Medical University, 1, Ostrovityanova, Moscow 117997, Russia.
Goncharov AO; Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.
Lobas AA; Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.
Ivanov MV; Pirogov Russian National Research Medical University, 1, Ostrovityanova, Moscow 117997, Russia.
Lazarev VN; V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia.
Ziganshin RH; V.L. Talrose Institute for Energy Problems of Chemical Physics, N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 38, bld. 2, Leninsky Prospect, Moscow 119334, Russia.
Gorshkov MV; Federal Research and Clinical Center of Physical-Chemical Medicine, 1a, Malaya Pirogovskaya, Moscow 119435, Russia.
Moshkovskii SA; Moscow Institute of Physics and Technology (State University), 9, Institutskiy per., Dolgoprudny, Moscow Region 141701, Russia.

J Proteome Res ; 21(6): 1438-1448, 2022 06 03.

Article en En | MEDLINE | ID: mdl-35536917

RESUMEN

Mass spectrometry-based proteome analysis implies matching the mass spectra of proteolytic peptides to amino acid sequences predicted from genomic sequences. Reliability of peptide variant identification in proteogenomic studies is often lacking. We propose a way to interpret shotgun proteomics results, specifically in the data-dependent acquisition mode, as protein sequence coverage by multiple reads as it is done in nucleic acid sequencing for calling of single nucleotide variants. Multiple reads for each sequence position could be provided by overlapping distinct peptides, thus confirming the presence of certain amino acid residues in the overlapping stretch with a lower false discovery rate. Overlapping distinct peptides originate from miscleaved tryptic peptides in combination with their properly cleaved counterparts and from peptides generated by multiple proteases after the same specimen is subject to parallel digestion and analyzed separately. We illustrate this approach using publicly available multiprotease data sets and our own data generated for the HEK-293 cell line digests obtained using trypsin, LysC, and GluC proteases. Totally, up to 30% of the whole proteome was covered by tryptic peptides with up to 7% covered twofold and more. The proteogenomic analysis of the HEK-293 cell line revealed 36 single amino acid variants, seven of which were supported by multiple reads.

Asunto(s)

Proteogenómica; Aminoácidos; Células HEK293; Humanos; Péptido Hidrolasas; Péptidos/análisis; Proteogenómica/métodos; Proteoma/análisis; Reproducibilidad de los Resultados

Palabras clave

SNP calling; data-dependent acquisition; false discovery rate; missense mutation; protease; proteogenomics; shotgun proteomics; single amino acid variant; single nucleotide variant

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Proteogenómica Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: J Proteome Res Asunto de la revista: BIOQUIMICA Año: 2022 Tipo del documento: Article País de afiliación: Rusia

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google