Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
J Appl Stat ; 50(8): 1725-1749, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37260475

RESUMEN

Motivated by an open problem of validating protein identities in label-free shotgun proteomics work-flows, we present a testing procedure to validate class (protein) labels using available measurements across N instances (peptides). More generally, we present a non-parametric solution to the problem of identifying instances that are deemed as outliers relative to the subset of instances assigned to the same class. The primary assumption is that measured distances between instances within the same class are stochastically smaller than measured distances between instances from different classes. We show that the overall type I error probability across all instances within a class can be controlled by some fixed value (say α). We also demonstrate conditions where similar results on type II error probability hold. The theoretical results are supplemented by an extensive numerical study illustrating the applicability and viability of our method. Even with up to 25% of instances initially mislabeled, our testing procedure maintains a high specificity and greatly reduces the proportion of mislabeled instances. The applicability and effectiveness of our testing procedure is further illustrated by a detailed example on a proteomics data set from children with sickle cell disease where five spike-in proteins acted as contrasting controls.

2.
Invest Ophthalmol Vis Sci ; 60(5): 1461-1469, 2019 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-30951588

RESUMEN

Purpose: To determine the effect of molecular weight (MW) on the concentration of plasma-derived proteins in aqueous humor and to estimate the plasma-derived and eye-derived fractions for each protein. Methods: Aqueous humor and plasma samples were obtained during cataract surgery on an institutional review board-approved protocol. Protein concentrations were determined by ELISA and quantitative antibody microarrays. A total of 93 proteins were studied, with most proteins analyzed using 27 to 116 aqueous and 6 to 30 plasma samples. Results: Plasma proteins without evidence of intraocular expression by sequence tags were used to fit a logarithmic model relating aqueous-plasma ratio (AH:PL) to MW. The log(AH:PL) appears to be well predicted by the log(MW) (P < 0.0001), with smaller proteins such as cystatin C (13 kDa) having a higher AH:PL (1:6) than larger proteins such as albumin (66 kDa, 1:300) and complement component 5 (188 kDa, 1:2500). The logarithmic model was used to calculate the eye-derived intraocular fraction (IOF) for each protein. Based on the IOF, 66 proteins could be categorized as plasma-derived (IOF<20), whereas 10 proteins were primarily derived from eye tissue (IOF >80), and 17 proteins had contribution from both plasma and eye tissue (IOF 20-80). Conclusions: Protein concentration of plasma-derived proteins in aqueous is nonlinearly dependent on MW in favor of smaller proteins. Our study demonstrates that for proper interpretation of results, proteomic studies evaluating changes in aqueous humor protein levels should take into account the plasma and eye-derived fractions.


Asunto(s)
Barrera Hematoacuosa/metabolismo , Catarata/metabolismo , Proteínas del Ojo/metabolismo , Anciano , Anciano de 80 o más Años , Ensayo de Inmunoadsorción Enzimática , Femenino , Humanos , Masculino , Persona de Mediana Edad , Peso Molecular , Proteómica/métodos
3.
BMC Bioinformatics ; 13 Suppl 16: S10, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-23176119

RESUMEN

Data visualization plays a critical role in interpreting experimental results of proteomic experiments. Heat maps are particularly useful for this task, as they allow us to find quantitative patterns across proteins and biological samples simultaneously. The quality of a heat map can be vastly improved by understanding the options available to display and organize the data in the heat map. This tutorial illustrates how to optimize heat maps for proteomics data by incorporating known characteristics of the data into the image. First, the concepts used to guide the creating of heat maps are demonstrated. Then, these concepts are applied to two types of analysis: visualizing spectral features across biological samples, and presenting the results of tests of statistical significance. For all examples we provide details of computer code in the open-source statistical programming language R, which can be used for biologists and clinicians with little statistical background. Heat maps are a useful tool for presenting quantitative proteomic data organized in a matrix format. Understanding and optimizing the parameters used to create the heat map can vastly improve both the appearance and the interoperation of heat map data.


Asunto(s)
Presentación de Datos , Espectrometría de Masas/estadística & datos numéricos , Proteínas/análisis , Proteómica/estadística & datos numéricos
4.
J Proteome Res ; 8(11): 5275-84, 2009 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-19891509

RESUMEN

The goal of many LC-MS proteomic investigations is to quantify and compare the abundance of proteins in complex biological mixtures. However, the output of an LC-MS experiment is not a list of proteins, but a list of quantified spectral features. To make protein-level conclusions, researchers typically apply ad hoc rules, or take an average of feature abundance to obtain a single protein-level quantity for each sample. We argue that these two approaches are inadequate. We discuss two statistical models, namely, fixed and mixed effects Analysis of Variance (ANOVA), which views individual features as replicate measurements of a protein's abundance, and explicitly account for this redundancy. We demonstrate, using a spike-in and a clinical data set, that the proposed models improve the sensitivity and specificity of testing, improve the accuracy of patient-specific protein quantifications, and are more robust in the presence of missing data.


Asunto(s)
Cromatografía Liquida/métodos , Espectrometría de Masas/métodos , Proteínas/análisis , Análisis de Varianza , Animales , Humanos , Modelos Estadísticos , Sensibilidad y Especificidad , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...