Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 77
Filtrar
1.
Cell ; 177(6): 1600-1618.e17, 2019 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-31150625

RESUMO

Autism spectrum disorder (ASD) manifests as alterations in complex human behaviors including social communication and stereotypies. In addition to genetic risks, the gut microbiome differs between typically developing (TD) and ASD individuals, though it remains unclear whether the microbiome contributes to symptoms. We transplanted gut microbiota from human donors with ASD or TD controls into germ-free mice and reveal that colonization with ASD microbiota is sufficient to induce hallmark autistic behaviors. The brains of mice colonized with ASD microbiota display alternative splicing of ASD-relevant genes. Microbiome and metabolome profiles of mice harboring human microbiota predict that specific bacterial taxa and their metabolites modulate ASD behaviors. Indeed, treatment of an ASD mouse model with candidate microbial metabolites improves behavioral abnormalities and modulates neuronal excitability in the brain. We propose that the gut microbiota regulates behaviors in mice via production of neuroactive metabolites, suggesting that gut-brain connections contribute to the pathophysiology of ASD.


Assuntos
Transtorno do Espectro Autista/microbiologia , Sintomas Comportamentais/microbiologia , Microbioma Gastrointestinal/fisiologia , Animais , Transtorno do Espectro Autista/metabolismo , Transtorno do Espectro Autista/fisiopatologia , Bactérias , Comportamento Animal/fisiologia , Encéfalo/metabolismo , Modelos Animais de Doenças , Humanos , Camundongos , Microbiota , Fatores de Risco
2.
Nat Chem Biol ; 2024 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-38302607

RESUMO

The leaf-cutter ant fungal garden ecosystem is a naturally evolved model system for efficient plant biomass degradation. Degradation processes mediated by the symbiotic fungus Leucoagaricus gongylophorus are difficult to characterize due to dynamic metabolisms and spatial complexity of the system. Herein, we performed microscale imaging across 12-µm-thick adjacent sections of Atta cephalotes fungal gardens and applied a metabolome-informed proteome imaging approach to map lignin degradation. This approach combines two spatial multiomics mass spectrometry modalities that enabled us to visualize colocalized metabolites and proteins across and through the fungal garden. Spatially profiled metabolites revealed an accumulation of lignin-related products, outlining morphologically unique lignin microhabitats. Metaproteomic analyses of these microhabitats revealed carbohydrate-degrading enzymes, indicating a prominent fungal role in lignocellulose decomposition. Integration of metabolome-informed proteome imaging data provides a comprehensive view of underlying biological pathways to inform our understanding of metabolic fungal pathways in plant matter degradation within the micrometer-scale environment.

3.
Mol Cell Proteomics ; 22(2): 100491, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36603806

RESUMO

Conventional proteomic approaches measure the averaged signal from mixed cell populations or bulk tissues, leading to the dilution of signals arising from subpopulations of cells that might serve as important biomarkers. Recent developments in bottom-up proteomics have enabled spatial mapping of cellular heterogeneity in tissue microenvironments. However, bottom-up proteomics cannot unambiguously define and quantify proteoforms, which are intact (i.e., functional) forms of proteins capturing genetic variations, alternatively spliced transcripts and posttranslational modifications. Herein, we described a spatially resolved top-down proteomics (TDP) platform for proteoform identification and quantitation directly from tissue sections. The spatial TDP platform consisted of a nanodroplet processing in one pot for trace samples-based sample preparation system and an laser capture microdissection-based cell isolation system. We improved the nanodroplet processing in one pot for trace samples sample preparation by adding benzonase in the extraction buffer to enhance the coverage of nucleus proteins. Using ∼200 cultured cells as test samples, this approach increased total proteoform identifications from 493 to 700; with newly identified proteoforms primarily corresponding to nuclear proteins. To demonstrate the spatial TDP platform in tissue samples, we analyzed laser capture microdissection-isolated tissue voxels from rat brain cortex and hypothalamus regions. We quantified 509 proteoforms within the union of top-down mass spectrometry-based proteoform identification and characterization and TDPortal identifications to match with features from protein mass extractor. Several proteoforms corresponding to the same gene exhibited mixed abundance profiles between two tissue regions, suggesting potential posttranslational modification-specific spatial distributions. The spatial TDP workflow has prospects for biomarker discovery at proteoform level from small tissue sections.


Assuntos
Proteoma , Proteômica , Proteoma/metabolismo , Microfluídica , Espectrometria de Massas , Proteínas de Ligação a DNA
4.
Proteomics ; : e2400025, 2024 Jun 19.
Artigo em Inglês | MEDLINE | ID: mdl-38895962

RESUMO

Extracellular vesicles (EVs) carry diverse biomolecules derived from their parental cells, making their components excellent biomarker candidates. However, purifying EVs is a major hurdle in biomarker discovery since current methods require large amounts of samples, are time-consuming and typically have poor reproducibility. Here we describe a simple, fast, and sensitive EV fractionation method using size exclusion chromatography (SEC) on a fast protein liquid chromatography (FPLC) system. Our method uses a Superose 6 Increase 5/150, which has a bed volume of 2.9 mL. The FPLC system and small column size enable reproducible separation of only 50 µL of human plasma in 15 min. To demonstrate the utility of our method, we used longitudinal samples from a group of individuals who underwent intense exercise. A total of 838 proteins were identified, of which, 261 were previously characterized as EV proteins, including classical markers, such as cluster of differentiation (CD)9 and CD81. Quantitative analysis showed low technical variability with correlation coefficients greater than 0.9 between replicates. The analysis captured differences in relevant EV proteins involved in response to physical activity. Our method enables fast and sensitive fractionation of plasma EVs with low variability, which will facilitate biomarker studies in large clinical cohorts.

5.
J Proteome Res ; 2024 Feb 29.
Artigo em Inglês | MEDLINE | ID: mdl-38421884

RESUMO

Proteoforms, the different forms of a protein with sequence variations including post-translational modifications (PTMs), execute vital functions in biological systems, such as cell signaling and epigenetic regulation. Advances in top-down mass spectrometry (MS) technology have permitted the direct characterization of intact proteoforms and their exact number of modification sites, allowing for the relative quantification of positional isomers (PI). Protein positional isomers refer to a set of proteoforms with identical total mass and set of modifications, but varying PTM site combinations. The relative abundance of PI can be estimated by matching proteoform-specific fragment ions to top-down tandem MS (MS2) data to localize and quantify modifications. However, the current approaches heavily rely on manual annotation. Here, we present IsoForma, an open-source R package for the relative quantification of PI within a single tool. Benchmarking IsoForma's performance against two existing workflows produced comparable results and improvements in speed. Overall, IsoForma provides a streamlined process for quantifying PI, reduces the analysis time, and offers an essential framework for developing customized proteoform analysis workflows. The software is open source and available at https://github.com/EMSL-Computing/isoforma-lib.

6.
Int J Mol Sci ; 25(8)2024 Apr 13.
Artigo em Inglês | MEDLINE | ID: mdl-38673911

RESUMO

One of the most significant challenges in human health risk assessment is to evaluate hazards from exposure to environmental chemical mixtures. Polycyclic aromatic hydrocarbons (PAHs) are a class of ubiquitous contaminants typically found as mixtures in gaseous and particulate phases in ambient air pollution associated with petrochemicals from Superfund sites and the burning of fossil fuels. However, little is understood about how PAHs in mixtures contribute to toxicity in lung cells. To investigate mixture interactions and component additivity from environmentally relevant PAHs, two synthetic mixtures were created from PAHs identified in passive air samplers at a legacy creosote site impacted by wildfires. The primary human bronchial epithelial cells differentiated at the air-liquid interface were treated with PAH mixtures at environmentally relevant proportions and evaluated for the differential expression of transcriptional biomarkers related to xenobiotic metabolism, oxidative stress response, barrier integrity, and DNA damage response. Component additivity was evaluated across all endpoints using two independent action (IA) models with and without the scaling of components by toxic equivalence factors. Both IA models exhibited trends that were unlike the observed mixture response and generally underestimated the toxicity across dose suggesting the potential for non-additive interactions of components. Overall, this study provides an example of the usefulness of mixture toxicity assessment with the currently available methods while demonstrating the need for more complex yet interpretable mixture response evaluation methods for environmental samples.


Assuntos
Células Epiteliais , Hidrocarbonetos Policíclicos Aromáticos , Humanos , Hidrocarbonetos Policíclicos Aromáticos/toxicidade , Hidrocarbonetos Policíclicos Aromáticos/metabolismo , Células Epiteliais/metabolismo , Células Epiteliais/efeitos dos fármacos , Estresse Oxidativo/efeitos dos fármacos , Dano ao DNA/efeitos dos fármacos , Modelos Biológicos , Poluentes Atmosféricos/toxicidade , Células Cultivadas , Brônquios/metabolismo , Brônquios/citologia , Brônquios/efeitos dos fármacos , Biomarcadores
7.
J Proteome Res ; 22(2): 570-576, 2023 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-36622218

RESUMO

The pmartR (https://github.com/pmartR/pmartR) package was designed for the quality control (QC) and analysis of mass spectrometry data, tailored to specific characteristics of proteomic (isobaric or labeled), metabolomic, and lipidomic data sets. Since its initial release, the tool has been expanded to address the needs of its growing userbase and now includes QC and statistics for nuclear magnetic resonance metabolomic data, and leverages the DESeq2, edgeR, and limma-voom R packages for transcriptomic data analyses. These improvements have made progress toward a unified omics processing pipeline for ease of reporting and streamlined statistical purposes. The package's statistics and visualization capabilities have also been expanded by adding support for paired data and by integrating pmartR with the trelliscopejs R package for the quick creation of trellis displays (https://github.com/hafen/trelliscopejs). Here, we present relevant examples of each of these enhancements to pmartR and highlight how each new feature benefits the omics community.


Assuntos
Proteômica , Software , Proteômica/métodos , Metabolômica/métodos , Perfilação da Expressão Gênica/métodos , Controle de Qualidade
8.
J Proteome Res ; 2023 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-38085827

RESUMO

PMart is a web-based tool for reproducible quality control, exploratory data analysis, statistical analysis, and interactive visualization of 'omics data, based on the functionality of the pmartR R package. The newly improved user interface supports more 'omics data types, additional statistical capabilities, and enhanced options for creating downloadable graphics. PMart supports the analysis of label-free and isobaric-labeled (e.g., TMT, iTRAQ) proteomics, nuclear magnetic resonance (NMR) and mass-spectrometry (MS)-based metabolomics, MS-based lipidomics, and ribonucleic acid sequencing (RNA-seq) transcriptomics data. At the end of a PMart session, a report is available that summarizes the processing steps performed and includes the pmartR R package functions used to execute the data processing. In addition, built-in safeguards in the backend code prevent users from utilizing methods that are inappropriate based on omics data type. PMart is a user-friendly interface for conducting exploratory data analysis and statistical comparisons of omics data without programming.

9.
Anal Chem ; 95(19): 7536-7544, 2023 05 16.
Artigo em Inglês | MEDLINE | ID: mdl-37129113

RESUMO

As metabolomics grows into a high-throughput and high demand research field, current metrics for the identification of small molecules in gas chromatography-mass spectrometry (GC-MS) still require manual verification. Though steps have been taken to improve scoring metrics by combining spectral similarity (SS) and retention index (RI), the problem persists. A large body of literature has analyzed and refined SS scores, but few studies have explicitly studied improvements to RI scores. Here, we examined whether uninvestigated assumptions of the RI score are valid and propose ways to improve them. Query RIs were matched to library RI with a generous window of ±35 to avoid unintentional removal of valid compound identifications. Each match was manually verified as a true positive (TP), true negative, or unknown. Metabolites with at least 30 TP identifications were included in downstream analyses, resulting in a total of 87 metabolites from samples of varying complexity and type (e.g., amino acid mixtures, human urine, fungal species, and so on.). Our results showed that the RI score assumptions of normality, consistent variance across metabolites, and a mean error centered at 0 are often violated. We demonstrated through a cross-validation analysis that modifying these underlying assumptions according to empirical metabolite-specific distributions improved the TP and negative rankings. Further, we statistically determined the minimum number of samples required to estimate distributional parameters for scoring metrics. Overall, this work proposes a robust statistical pipeline to reduce the time bottleneck of metabolite identification by improving RI scores and thus minimize the effort to complete manual verification.


Assuntos
Metabolômica , Humanos , Cromatografia Gasosa-Espectrometria de Massas/métodos , Metabolômica/métodos
10.
Anal Chem ; 95(33): 12195-12199, 2023 08 22.
Artigo em Inglês | MEDLINE | ID: mdl-37551970

RESUMO

Mass spectrometry is a powerful tool for identifying and analyzing biomolecules such as metabolites and lipids in complex biological samples. Liquid chromatography and gas chromatography mass spectrometry studies quite commonly involve large numbers of samples, which can require significant time for sample preparation and analyses. To accommodate such studies, the samples are commonly split into batches. Inevitably, variations in sample handling, temperature fluctuation, imprecise timing, column degradation, and other factors result in systematic errors or biases of the measured abundances between the batches. Numerous methods are available via R packages to assist with batch correction for omics data; however, since these methods were developed by different research teams, the algorithms are available in separate R packages, each with different data input and output formats. We introduce the malbacR package, which consolidates 11 common batch effect correction methods for omics data into one place so users can easily implement and compare the following: pareto scaling, power scaling, range scaling, ComBat, EigenMS, NOMIS, RUV-random, QC-RLSC, WaveICA2.0, TIGER, and SERRF. The malbacR package standardizes data input and output formats across these batch correction methods. The package works in conjunction with the pmartR package, allowing users to seamlessly include the batch effect correction in a pmartR workflow without needing any additional data manipulation.


Assuntos
Algoritmos , Projetos de Pesquisa , Cromatografia Líquida/métodos , Espectrometria de Massas/métodos , Cromatografia Gasosa-Espectrometria de Massas
11.
Am J Respir Crit Care Med ; 205(2): 208-218, 2022 01 15.
Artigo em Inglês | MEDLINE | ID: mdl-34752721

RESUMO

Rationale: The current understanding of human lung development derives mostly from animal studies. Although transcript-level studies have analyzed human donor tissue to identify genes expressed during normal human lung development, protein-level analysis that would enable the generation of new hypotheses on the processes involved in pulmonary development are lacking. Objectives: To define the temporal dynamic of protein expression during human lung development. Methods: We performed proteomics analysis of human lungs at 10 distinct times from birth to 8 years to identify the molecular networks mediating postnatal lung maturation. Measurements and Main Results: We identified 8,938 proteins providing a comprehensive view of the developing human lung proteome. The analysis of the data supports the existence of distinct molecular substages of alveolar development and predicted the age of independent human lung samples, and extensive remodeling of the lung proteome occurred during postnatal development. Evidence of post-transcriptional control was identified in early postnatal development. An extensive extracellular matrix remodeling was supported by changes in the proteome during alveologenesis. The concept of maturation of the immune system as an inherent part of normal lung development was substantiated by flow cytometry and transcriptomics. Conclusions: This study provides the first in-depth characterization of the human lung proteome during development, providing a unique proteomic resource freely accessible at Lungmap.net. The data support the extensive remodeling of the lung proteome during development, the existence of molecular substages of alveologenesis, and evidence of post-transcriptional control in early postnatal development.


Assuntos
Regulação da Expressão Gênica no Desenvolvimento/fisiologia , Pulmão/crescimento & desenvolvimento , Pulmão/metabolismo , Proteínas/genética , Proteínas/metabolismo , Alvéolos Pulmonares/crescimento & desenvolvimento , Alvéolos Pulmonares/metabolismo , Criança , Pré-Escolar , Feminino , Humanos , Lactente , Recém-Nascido , Masculino , Proteômica
12.
J Proteome Res ; 21(4): 891-898, 2022 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-35220718

RESUMO

Bottom-up proteomics provides peptide measurements and has been invaluable for moving proteomics into large-scale analyses. Commonly, a single quantitative value is reported for each protein-coding gene by aggregating peptide quantities into protein groups following protein inference or parsimony. However, given the complexity of both RNA splicing and post-translational protein modification, it is overly simplistic to assume that all peptides that map to a singular protein-coding gene will demonstrate the same quantitative response. By assuming that all peptides from a protein-coding sequence are representative of the same protein, we may miss the discovery of important biological differences. To capture the contributions of existing proteoforms, we need to reconsider the practice of aggregating protein values to a single quantity per protein-coding gene.


Assuntos
Proteínas , Proteômica , Peptídeos/genética , Peptídeos/metabolismo , Processamento de Proteína Pós-Traducional , Proteínas/metabolismo , Proteoma/genética , Proteoma/metabolismo
13.
J Biol Chem ; 296: 100340, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33515546

RESUMO

The lipid composition of HIV-1 virions is enriched in sphingomyelin (SM), but the roles that SM or other sphingolipids (SLs) might play in the HIV-1 replication pathway have not been elucidated. In human cells, SL levels are regulated by ceramide synthase (CerS) enzymes that produce ceramides, which can be converted to SMs, hexosylceramides, and other SLs. In many cell types, CerS2, which catalyzes the synthesis of very long chain ceramides, is the major CerS. We have examined how CerS2 deficiency affects the assembly and infectivity of HIV-1. As expected, we observed that very long chain ceramide, hexosylceramide, and SM were reduced in CerS2 knockout cells. CerS2 deficiency did not affect HIV-1 assembly or the incorporation of the HIV-1 envelope (Env) protein into virus particles, but it reduced the infectivites of viruses produced in the CerS2-deficient cells. The reduced viral infection levels were dependent on HIV-1 Env, since HIV-1 particles that were pseudotyped with the vesicular stomatitis virus glycoprotein did not exhibit reductions in infectivity. Moreover, cell-cell fusion assays demonstrated that the functional defect of HIV-1 Env in CerS2-deficient cells was independent of other viral proteins. Overall, our results indicate that the altered lipid composition of CerS2-deficient cells specifically inhibit the HIV-1 Env receptor binding and/or fusion processes.


Assuntos
Deleção de Genes , Infecções por HIV/genética , HIV-1/fisiologia , Proteínas de Membrana/genética , Esfingosina N-Aciltransferase/genética , Proteínas Supressoras de Tumor/genética , Ceramidas/genética , Ceramidas/metabolismo , Células HEK293 , Infecções por HIV/metabolismo , Humanos , Proteínas de Membrana/metabolismo , Esfingosina N-Aciltransferase/metabolismo , Proteínas Supressoras de Tumor/metabolismo , Internalização do Vírus
14.
BMC Bioinformatics ; 22(1): 287, 2021 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-34051754

RESUMO

BACKGROUND: Representing biological networks as graphs is a powerful approach to reveal underlying patterns, signatures, and critical components from high-throughput biomolecular data. However, graphs do not natively capture the multi-way relationships present among genes and proteins in biological systems. Hypergraphs are generalizations of graphs that naturally model multi-way relationships and have shown promise in modeling systems such as protein complexes and metabolic reactions. In this paper we seek to understand how hypergraphs can more faithfully identify, and potentially predict, important genes based on complex relationships inferred from genomic expression data sets. RESULTS: We compiled a novel data set of transcriptional host response to pathogenic viral infections and formulated relationships between genes as a hypergraph where hyperedges represent significantly perturbed genes, and vertices represent individual biological samples with specific experimental conditions. We find that hypergraph betweenness centrality is a superior method for identification of genes important to viral response when compared with graph centrality. CONCLUSIONS: Our results demonstrate the utility of using hypergraphs to represent complex biological systems and highlight central important responses in common to a variety of highly pathogenic viruses.


Assuntos
Algoritmos , Modelos Biológicos , Genômica , Proteínas
15.
J Proteome Res ; 20(1): 1-13, 2021 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-32929967

RESUMO

The throughput efficiency and increased depth of coverage provided by isobaric-labeled proteomics measurements have led to increased usage of these techniques. However, the structure of missing data is different than unlabeled studies, which prompts the need for this review to compare the efficacy of nine imputation methods on large isobaric-labeled proteomics data sets to guide researchers on the appropriateness of various imputation methods. Imputation methods were evaluated by accuracy, statistical hypothesis test inference, and run time. In general, expectation maximization and random forest imputation methods yielded the best performance, and constant-based methods consistently performed poorly across all data set sizes and percentages of missing values. For data sets with small sample sizes and higher percentages of missing data, results indicate that statistical inference with no imputation may be preferable. On the basis of the findings in this review, there are core imputation methods that perform better for isobaric-labeled proteomics data, but great care and consideration as to whether imputation is the optimal strategy should be given for data sets comprised of a small number of samples.


Assuntos
Algoritmos , Proteômica
16.
J Proteome Res ; 20(4): 2014-2020, 2021 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-33661636

RESUMO

Visual examination of mass spectrometry data is necessary to assess data quality and to facilitate data exploration. Graphics provide the means to evaluate spectral properties, test alternative peptide/protein sequence matches, prepare annotated spectra for publication, and fine-tune parameters during wet lab procedures. Visual inspection of LC-MS data is constrained by proteomics visualization software designed for particular workflows or vendor-specific tools without open-source code. We built PSpecteR, an open-source and interactive R Shiny web application for visualization of LC-MS data, with support for several steps of proteomics data processing, including reading various mass spectrometry files, running open-source database search engines, labeling spectra with fragmentation patterns, testing post-translational modifications, plotting where identified fragments map to reference sequences, and visualizing algorithmic output and metadata. All figures, tables, and spectra are exportable within one easy-to-use graphical user interface. Our current software provides a flexible and modern R framework to support fast implementation of additional features. The open-source code is readily available (https://github.com/EMSL-Computing/PSpecteR), and a PSpecteR Docker container (https://hub.docker.com/r/emslcomputing/pspecter) is available for easy local installation.


Assuntos
Proteômica , Espectrometria de Massas em Tandem , Cromatografia Líquida , Proteínas , Software
17.
PLoS Comput Biol ; 16(3): e1007654, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-32176690

RESUMO

The high-resolution and mass accuracy of Fourier transform mass spectrometry (FT-MS) has made it an increasingly popular technique for discerning the composition of soil, plant and aquatic samples containing complex mixtures of proteins, carbohydrates, lipids, lignins, hydrocarbons, phytochemicals and other compounds. Thus, there is a growing demand for informatics tools to analyze FT-MS data that will aid investigators seeking to understand the availability of carbon compounds to biotic and abiotic oxidation and to compare fundamental chemical properties of complex samples across groups. We present ftmsRanalysis, an R package which provides an extensive collection of data formatting and processing, filtering, visualization, and sample and group comparison functionalities. The package provides a suite of plotting methods and enables expedient, flexible and interactive visualization of complex datasets through functions which link to a powerful and interactive visualization user interface, Trelliscope. Example analysis using FT-MS data from a soil microbiology study demonstrates the core functionality of the package and highlights the capabilities for producing interactive visualizations.


Assuntos
Biologia Computacional/métodos , Análise de Fourier , Espectrometria de Massas , Software , Bases de Dados Factuais , Microbiologia do Solo
18.
Rapid Commun Mass Spectrom ; 35(9): e9068, 2021 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-33590907

RESUMO

RATIONALE: Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS) is a preferred technique for analyzing complex organic mixtures. Currently, there is no consensus normalization approach, nor an objective method for selecting one, for quantitative analyses of FT-ICR-MS data. We investigate a method to evaluate and score the amount of bias various normalization approaches introduce into the data. METHODS: We evaluate the ability of the Statistical Procedure for the Analysis of Normalization Strategies (SPANS) to guide the selection of appropriate normalization approaches for two different FT-ICR-MS data sets. Furthermore, we test the robustness of SPANS results to changes in SPANS parameter values and assess the impact of using various normalization approaches on downstream statistical analyses. RESULTS: The normalization approach identified by SPANS differed for the two data sets. Normalization methods impacted the statistical significance of peaks differently, underscoring the importance of carefully evaluating potential methods. More consistent SPANS scores resulted when at least 120 significant peaks are used, where larger sets of peaks were obtained by increasing the p-value threshold. Interestingly, we show that total sum scaling and highest peak normalization, used in previous studies, underperformed relative to SPANS-recommended normalization approaches. CONCLUSIONS: Although there is no single, best normalization method for all data sets, SPANS provides a mechanism to identify an appropriate normalization method for analyzing FT-ICR-MS data quantitatively. The number of peaks used in the background distributions of SPANS contributes more significantly to the reproducibility of results than the p-value thresholds used to obtain those peaks.

19.
Anal Chem ; 92(2): 1796-1803, 2020 01 21.
Artigo em Inglês | MEDLINE | ID: mdl-31742994

RESUMO

Advancements in molecular separations coupled with mass spectrometry have enabled metabolome analyses for clinical cohorts. A population of interest for metabolome profiling is patients with rare disease for which abnormal metabolic signatures may yield clues into the genetic basis, as well as mechanistic drivers of the disease and possible treatment options. We undertook the metabolome profiling of a large cohort of patients with mysterious conditions characterized through the Undiagnosed Diseases Network (UDN). Due to the size and enrollment procedures, collection of the metabolomes for UDN patients took place over 2 years. We describe the study designed to adjust for measurements collected over a long time scale and how this enabled statistical analyses to summarize the metabolome of individual patients. We demonstrate the removal of time-based batch effects, overall statistical characteristics of the UDN population, and two case studies of interest that demonstrate the utility of metabolome profiling for rare diseases.


Assuntos
Lipídeos/análise , Modelos Estatísticos , Doenças não Diagnosticadas/diagnóstico , Estudos de Coortes , Humanos , Metabolômica , Doenças não Diagnosticadas/metabolismo
20.
Mol Cell Proteomics ; 17(9): 1824-1836, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-29666158

RESUMO

Liquid chromatography-mass spectrometry (LC-MS)-based proteomics studies of large sample cohorts can easily require from months to years to complete. Acquiring consistent, high-quality data in such large-scale studies is challenging because of normal variations in instrumentation performance over time, as well as artifacts introduced by the samples themselves, such as those because of collection, storage and processing. Existing quality control methods for proteomics data primarily focus on post-hoc analysis to remove low-quality data that would degrade downstream statistics; they are not designed to evaluate the data in near real-time, which would allow for interventions as soon as deviations in data quality are detected. In addition to flagging analyses that demonstrate outlier behavior, evaluating how the data structure changes over time can aide in understanding typical instrument performance or identify issues such as a degradation in data quality because of the need for instrument cleaning and/or re-calibration. To address this gap for proteomics, we developed Quality Control Analysis in Real-Time (QC-ART), a tool for evaluating data as they are acquired to dynamically flag potential issues with instrument performance or sample quality. QC-ART has similar accuracy as standard post-hoc analysis methods with the additional benefit of real-time analysis. We demonstrate the utility and performance of QC-ART in identifying deviations in data quality because of both instrument and sample issues in near real-time for LC-MS-based plasma proteomics analyses of a sample subset of The Environmental Determinants of Diabetes in the Young cohort. We also present a case where QC-ART facilitated the identification of oxidative modifications, which are often underappreciated in proteomic experiments.


Assuntos
Sistemas Computacionais , Proteômica/métodos , Proteômica/normas , Controle de Qualidade , Espectrometria de Massas em Tandem/métodos , Algoritmos , Estudos de Coortes , Bases de Dados de Proteínas , Humanos , Marcação por Isótopo , Oxirredução , Peptídeos/metabolismo , Curva ROC , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA