Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
1.
Microbiol Spectr ; 12(3): e0345623, 2024 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-38294221

RESUMO

To infer the biological meaning from transcriptome data, it is useful to focus on genes that are regulated by the same regulator, i.e., regulons. Unfortunately, current gene set enrichment analysis (GSEA) tools do not consider whether a gene is activated or repressed by a regulator. This distinction is crucial when analyzing regulons since a regulator can work as an activator of certain genes and as a repressor of other genes, yet both sets of genes belong to the same regulon. Therefore, simply averaging expression differences of the genes of such a regulon will not properly reflect the activity of the regulator. What makes it more complicated is the fact that many genes are regulated by different transcription factors, and current transcriptome analysis tools are unable to indicate which regulator is most likely responsible for the observed expression difference of a gene. To address these challenges, we developed the gene set enrichment analysis program GINtool. Additional features of GINtool are novel graphical representations to facilitate the visualization of gene set analyses of transcriptome data, the possibility to include functional categories as gene sets for analysis, and the option to analyze expression differences within operons, which is useful when analyzing prokaryotic transcriptome and also proteome data.IMPORTANCEMeasuring the activity of all genes in cells is a common way to elucidate the function and regulation of genes. These transcriptome analyses produce large amounts of data since genomes contain thousands of genes. The analysis of these large data sets is challenging. Therefore, we developed a new software tool called GINtool that can facilitate the analysis of transcriptome data by using prior knowledge of gene sets controlled by the same regulator, the so-called regulons. An important novelty of GINtool is that it can take into account the directionality of gene regulation in these analyses, i.e., whether a gene is activated or repressed, which is crucial to assess whether a regulon or functional category is affected. GINtool also includes new graphical methods to facilitate the visual inspection of regulation events in transcriptome data sets. These and additional analysis methods included in GINtool make it a powerful software tool to analyze transcriptome data.


Assuntos
Perfilação da Expressão Gênica , Transcriptoma , Fatores de Transcrição , Software , Óperon , Regulação Bacteriana da Expressão Gênica
2.
Microb Cell Fact ; 22(1): 231, 2023 Nov 09.
Artigo em Inglês | MEDLINE | ID: mdl-37946188

RESUMO

BACKGROUND: The bacterium Bacillus subtilis is extensively used for the commercial production of enzymes due to its efficient protein secretion capacity. However, the efficiency of secretion varies greatly between enzymes, and despite many years of research, optimization of enzyme production is still largely a matter of trial-and-error. Genome-wide transcriptome analysis seems a useful tool to identify relevant secretion bottlenecks, yet to this day, only a limited number of transcriptome studies have been published that focus on enzyme secretion in B. subtilis. Here, we examined the effect of high-level expression of the commercially important enzyme endo-1,4-ß-xylanase XynA on the B. subtilis transcriptome using RNA-seq. RESULTS: Using the novel gene-set analysis tool GINtool, we found a reduced activity of the CtsR regulon when XynA was overproduced. This regulon comprises several protein chaperone genes, including clpC, clpE and clpX, and is controlled by transcriptional repression. CtsR levels are directly controlled by regulated proteolysis, involving ClpC and its cognate protease ClpP. When we abolished this negative feedback, by inactivating the repressor CtsR, the XynA production increased by 25%. CONCLUSIONS: Overproduction of enzymes can reduce the pool of Clp protein chaperones in B. subtilis, presumably due to negative feedback regulation. Breaking this feedback can improve enzyme production yields. Considering the conserved nature of Clp chaperones and their regulation, this method might benefit high-yield enzyme production in other organisms.


Assuntos
Bacillus subtilis , Proteínas de Choque Térmico , Proteínas de Choque Térmico/genética , Bacillus subtilis/metabolismo , Regulon , Proteínas Repressoras/metabolismo , Adenosina Trifosfatases/genética , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Regulação Bacteriana da Expressão Gênica
3.
BMC Plant Biol ; 21(1): 315, 2021 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-34215189

RESUMO

BACKGROUND: Plant-produced specialised metabolites are a powerful part of a plant's first line of defence against herbivorous insects, bacteria and fungi. Wild ancestors of present-day cultivated tomato produce a plethora of acylsugars in their type-I/IV trichomes and volatiles in their type-VI trichomes that have a potential role in plant resistance against insects. However, metabolic profiles are often complex mixtures making identification of the functionally interesting metabolites challenging. Here, we aimed to identify specialised metabolites from a wide range of wild tomato genotypes that could explain resistance to vector insects whitefly (Bemisia tabaci) and Western flower thrips (Frankliniella occidentalis). We evaluated plant resistance, determined trichome density and obtained metabolite profiles of the glandular trichomes by LC-MS (acylsugars) and GC-MS (volatiles). Using a customised Random Forest learning algorithm, we determined the contribution of specific specialised metabolites to the resistance phenotypes observed. RESULTS: The selected wild tomato accessions showed different levels of resistance to both whiteflies and thrips. Accessions resistant to one insect can be susceptible to another. Glandular trichome density is not necessarily a good predictor for plant resistance although the density of type-I/IV trichomes, related to the production of acylsugars, appears to correlate with whitefly resistance. For type VI-trichomes, however, it seems resistance is determined by the specific content of the glands. There is a strong qualitative and quantitative variation in the metabolite profiles between different accessions, even when they are from the same species. Out of 76 acylsugars found, the random forest algorithm linked two acylsugars (S3:15 and S3:21) to whitefly resistance, but none to thrips resistance. Out of 86 volatiles detected, the sesquiterpene α-humulene was linked to whitefly susceptible accessions instead. The algorithm did not link any specific metabolite to resistance against thrips, but monoterpenes α-phellandrene, α-terpinene and ß-phellandrene/D-limonene were significantly associated with susceptible tomato accessions. CONCLUSIONS: Whiteflies and thrips are distinctly targeted by certain specialised metabolites found in wild tomatoes. The machine learning approach presented helped to identify features with efficacy toward the insect species studied. These acylsugar metabolites can be targets for breeding efforts towards the selection of insect-resistant cultivars.


Assuntos
Resistência à Doença/genética , Variação Genética , Hemípteros/fisiologia , Metaboloma/genética , Solanum/genética , Tisanópteros/fisiologia , Tricomas/genética , Tricomas/metabolismo , Algoritmos , Animais , Ecótipo , Genótipo , Fenótipo , Compostos Orgânicos Voláteis/análise
4.
Mol Genet Metab Rep ; 27: 100761, 2021 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-33996490

RESUMO

Guanidinoacetate methyltransferase (GAMT) deficiency is a creatine deficiency disorder and an inborn error of metabolism presenting with progressive intellectual and neurological deterioration. As most cases are identified and treated in early childhood, adult phenotypes that can help in understanding the natural history of the disorder are rare. We describe two adult cases of GAMT deficiency from a consanguineous family in Pakistan that presented with a history of global developmental delay, cognitive impairments, excessive drooling, behavioral abnormalities, contractures and apparent bone deformities initially presumed to be the reason for abnormal gait. Exome sequencing identified a homozygous nonsense variant in GAMT: NM_000156.5:c.134G>A (p.Trp45*). We also performed a literature review and compiled the genetic and clinical characteristics of all adult cases of GAMT deficiency reported to date. When compared to the adult cases previously reported, the musculoskeletal phenotype and the rapidly progressive nature of neurological and motor decline seen in our patients is striking. This study presents an opportunity to gain insights into the adult presentation of GAMT deficiency and highlights the need for in-depth evaluation and reporting of clinical features to expand our understanding of the phenotypic spectrum.

5.
Front Genet ; 12: 620453, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33747045

RESUMO

Technologies for profiling samples using different omics platforms have been at the forefront since the human genome project. Large-scale multi-omics data hold the promise of deciphering different regulatory layers. Yet, while there is a myriad of bioinformatics tools, each multi-omics analysis appears to start from scratch with an arbitrary decision over which tools to use and how to combine them. Therefore, it is an unmet need to conceptualize how to integrate such data and implement and validate pipelines in different cases. We have designed a conceptual framework (STATegra), aiming it to be as generic as possible for multi-omics analysis, combining available multi-omic anlaysis tools (machine learning component analysis, non-parametric data combination, and a multi-omics exploratory analysis) in a step-wise manner. While in several studies, we have previously combined those integrative tools, here, we provide a systematic description of the STATegra framework and its validation using two The Cancer Genome Atlas (TCGA) case studies. For both, the Glioblastoma and the Skin Cutaneous Melanoma (SKCM) cases, we demonstrate an enhanced capacity of the framework (and beyond the individual tools) to identify features and pathways compared to single-omics analysis. Such an integrative multi-omics analysis framework for identifying features and components facilitates the discovery of new biology. Finally, we provide several options for applying the STATegra framework when parametric assumptions are fulfilled and for the case when not all the samples are profiled for all omics. The STATegra framework is built using several tools, which are being integrated step-by-step as OpenSource in the STATegRa Bioconductor package.

7.
PLoS Comput Biol ; 16(9): e1008295, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32997685

RESUMO

The field of transcriptomics uses and measures mRNA as a proxy of gene expression. There are currently two major platforms in use for quantifying mRNA, microarray and RNA-Seq. Many comparative studies have shown that their results are not always consistent. In this study we aim to find a robust method to increase comparability of both platforms enabling data analysis of merged data from both platforms. We transformed high dimensional transcriptomics data from two different platforms into a lower dimensional, and biologically relevant dataset by calculating enrichment scores based on gene set collections for all samples. We compared the similarity between data from both platforms based on the raw data and on the enrichment scores. We show that the performed data transforms the data in a biologically relevant way and filters out noise which leads to increased platform concordance. We validate the procedure using predictive models built with microarray based enrichment scores to predict subtypes of breast cancer using enrichment scores based on sequenced data. Although microarray and RNA-Seq expression levels might appear different, transforming them into biologically relevant gene set enrichment scores significantly increases their correlation, which is a step forward in data integration of the two platforms. The gene set collections were shown to contain biologically relevant gene sets. More in-depth investigation on the effect of the composition, size, and number of gene sets that are used for the transformation is suggested for future research.


Assuntos
Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Análise de Sequência com Séries de Oligonucleotídeos , RNA-Seq , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Feminino , Humanos , Reprodutibilidade dos Testes , Transcriptoma/genética
8.
Front Cell Dev Biol ; 8: 520, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32671069

RESUMO

X-linked adrenoleukodystrophy (ALD) is a peroxisomal metabolic disorder with a highly complex clinical presentation. ALD is caused by mutations in the ABCD1 gene, and is characterized by the accumulation of very long-chain fatty acids in plasma and tissues. Disease-causing mutations are 'loss of function' mutations, with no prognostic value with respect to the clinical outcome of an individual. All male patients with ALD develop spinal cord disease and a peripheral neuropathy in adulthood, although age of onset is highly variable. However, the lifetime prevalence to develop progressive white matter lesions, termed cerebral ALD (CALD), is only about 60%. Early identification of transition to CALD is critical since it can be halted by allogeneic hematopoietic stem cell therapy only in an early stage. The primary goal of this study is to identify molecular markers which may be prognostic of cerebral demyelination from a simple blood sample, with the hope that blood-based assays can replace the current protocols for diagnosis. We collected six well-characterized brother pairs affected by ALD and discordant for the presence of CALD and performed multi-omic profiling of blood samples including genome, epigenome, transcriptome, metabolome/lipidome, and proteome profiling. In our analysis we identify discordant genomic alleles present across all families as well as differentially abundant molecular features across the omics technologies. The analysis was focused on univariate modeling to discriminate the two phenotypic groups, but was unable to identify statistically significant candidate molecular markers. Our study highlights the issues caused by a large amount of inter-individual variation, and supports the emerging hypothesis that cerebral demyelination is a complex mix of environmental factors and/or heterogeneous genomic alleles. We confirm previous observations about the role of immune response, specifically auto-immunity and the potential role of PFN1 protein overabundance in CALD in a subset of the families. We envision our methodology as well as dataset has utility to the field for reproducing previous or enabling future modifier investigations.

9.
Sci Data ; 6(1): 256, 2019 10 31.
Artigo em Inglês | MEDLINE | ID: mdl-31672995

RESUMO

Multi-omics approaches use a diversity of high-throughput technologies to profile the different molecular layers of living cells. Ideally, the integration of this information should result in comprehensive systems models of cellular physiology and regulation. However, most multi-omics projects still include a limited number of molecular assays and there have been very few multi-omic studies that evaluate dynamic processes such as cellular growth, development and adaptation. Hence, we lack formal analysis methods and comprehensive multi-omics datasets that can be leveraged to develop true multi-layered models for dynamic cellular systems. Here we present the STATegra multi-omics dataset that combines measurements from up to 10 different omics technologies applied to the same biological system, namely the well-studied mouse pre-B-cell differentiation. STATegra includes high-throughput measurements of chromatin structure, gene expression, proteomics and metabolomics, and it is complemented with single-cell data. To our knowledge, the STATegra collection is the most diverse multi-omics dataset describing a dynamic biological system.


Assuntos
Linfócitos B , Diferenciação Celular , Animais , Linfócitos B/citologia , Linfócitos B/fisiologia , Linhagem Celular , Genômica , Metabolômica , Camundongos , Proteômica
10.
BMC Bioinformatics ; 17 Suppl 5: 195, 2016 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-27294690

RESUMO

BACKGROUND: Joint and individual variation explained (JIVE), distinct and common simultaneous component analysis (DISCO) and O2-PLS, a two-block (X-Y) latent variable regression method with an integral OSC filter can all be used for the integrated analysis of multiple data sets and decompose them in three terms: a low(er)-rank approximation capturing common variation across data sets, low(er)-rank approximations for structured variation distinctive for each data set, and residual noise. In this paper these three methods are compared with respect to their mathematical properties and their respective ways of defining common and distinctive variation. RESULTS: The methods are all applied on simulated data and mRNA and miRNA data-sets from GlioBlastoma Multiform (GBM) brain tumors to examine their overlap and differences. When the common variation is abundant, all methods are able to find the correct solution. With real data however, complexities in the data are treated differently by the three methods. CONCLUSIONS: All three methods have their own approach to estimate common and distinctive variation with their specific strength and weaknesses. Due to their orthogonality properties and their used algorithms their view on the data is slightly different. By assuming orthogonality between common and distinctive, true natural or biological phenomena that may not be orthogonal at all might be misinterpreted.


Assuntos
Algoritmos , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/metabolismo , Neoplasias Encefálicas/patologia , Glioblastoma/genética , Glioblastoma/metabolismo , Glioblastoma/patologia , Humanos , MicroRNAs/metabolismo , Análise de Componente Principal , RNA Mensageiro/metabolismo
11.
Anal Chem ; 87(12): 5921-9, 2015 Jun 16.
Artigo em Inglês | MEDLINE | ID: mdl-25965142

RESUMO

The possible presence of matrix effect is one of the main concerns in liquid chromatography-mass spectrometry (LC-MS)-driven bioanalysis due to its impact on the reliability of the obtained quantitative results. Here we propose an approach to correct for the matrix effect in LC-MS with electrospray ionization using postcolumn infusion of eight internal standards (PCI-IS). We applied this approach to a generic ultraperformance liquid chromatography-time-of-flight (UHPLC-TOF) platform developed for small-molecule profiling with a main focus on drugs. Different urine samples were spiked with 19 drugs with different physicochemical properties and analyzed in order to study matrix effect (in absolute and relative terms). Furthermore, calibration curves for each analyte were constructed and quality control samples at different concentration levels were analyzed to check the applicability of this approach in quantitative analysis. The matrix effect profiles of the PCI-ISs were different: this confirms that the matrix effect is compound-dependent, and therefore the most suitable PCI-IS has to be chosen for each analyte. Chromatograms were reconstructed using analyte and PCI-IS responses, which were used to develop an optimized method which compensates for variation in ionization efficiency. The approach presented here improved the results in terms of matrix effect dramatically. Furthermore, calibration curves of higher quality are obtained, dynamic range is enhanced, and accuracy and precision of QC samples is increased. The use of PCI-ISs is a very promising step toward an analytical platform free of matrix effect, which can make LC-MS analysis even more successful, adding a higher reliability in quantification to its intrinsic high sensitivity and selectivity.


Assuntos
Preparações Farmacêuticas/urina , Acetaminofen/urina , Benzimidazóis/urina , Benzoatos/urina , Compostos de Bifenilo , Cromatografia Líquida de Alta Pressão/instrumentação , Clomipramina/urina , Di-Hidropiridinas/urina , Encefalina Leucina/urina , Humanos , Espectrometria de Massas/instrumentação , Nifedipino/urina , Sinvastatina/urina , Telmisartan , Tetrazóis/urina , Fatores de Tempo
12.
Mol Nutr Food Res ; 58(11): 2111-21, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-25045152

RESUMO

SCOPE: Genistein from foods or supplements is metabolized by the gut microbiota and the human body, thereby releasing many different metabolites into systemic circulation. The order of their appearance in plasma and the possible influence of food format are still unknown. This study compared the nutrikinetic profiles of genistein metabolites. METHODS AND RESULTS: In a randomized cross-over trial, 12 healthy young volunteers were administered a single dose of 30 mg genistein provided as a genistein tablet, a genistein tablet in low fat milk, and soy milk containing genistein glycosides. A high mass resolution LC-LTQ-Orbitrap FTMS platform detected and quantified in human plasma: free genistein, seven of its phase-II metabolites and 15 gut-derived metabolites. Interestingly, a novel metabolite, genistein-4'-glucuronide-7-sulfate (G-4'G-7S) was identified. Nutrikinetic analysis using population-based modeling revealed the order of appearance of five genistein phase II metabolites in plasma: (1) genistein-4',7-diglucuronide, (2) genistein-7-sulfate, (3) genistein-4'-sulfate-7-glucuronide, (4) genistein-4'-glucuronide, and (5) genistein-7-glucuronide, independent of the food matrix. CONCLUSION: The conjugated genistein metabolites appear in a distinct order in human plasma. The specific early appearance of G-4',7-diG suggests a multistep formation process for the mono and hetero genistein conjugates, involving one or two deglucuronidation steps.


Assuntos
Genisteína/análogos & derivados , Administração Oral , Adolescente , Adulto , Animais , Índice de Massa Corporal , Cromatografia Líquida de Alta Pressão , Cromatografia Líquida , Estudos Cross-Over , Relação Dose-Resposta a Droga , Feminino , Genisteína/administração & dosagem , Genisteína/sangue , Genisteína/farmacocinética , Voluntários Saudáveis , Humanos , Masculino , Espectrometria de Massas , Leite/química , Leite de Soja/química , Adulto Jovem
13.
Anal Chim Acta ; 801: 34-42, 2013 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-24139572

RESUMO

Because of its high sensitivity and specificity, hyphenated mass spectrometry has become the predominant method to detect and quantify metabolites present in bio-samples relevant for all sorts of life science studies being executed. In contrast to targeted methods that are dedicated to specific features, global profiling acquisition methods allow new unspecific metabolites to be analyzed. The challenge with these so-called untargeted methods is the proper and automated extraction and integration of features that could be of relevance. We propose a new algorithm that enables untargeted integration of samples that are measured with high resolution liquid chromatography-mass spectrometry (LC-MS). In contrast to other approaches limited user interaction is needed allowing also less experienced users to integrate their data. The large amount of single features that are found within a sample is combined to a smaller list of, compound-related, grouped feature-sets representative for that sample. These feature-sets allow for easier interpretation and identification and as important, easier matching over samples. We show that the automatic obtained integration results for a set of known target metabolites match those generated with vendor software but that at least 10 times more feature-sets are extracted as well. We demonstrate our approach using high resolution LC-MS data acquired for 128 samples on a lipidomics platform. The data was also processed in a targeted manner (with a combination of automatic and manual integration) using vendor software for a set of 174 targets. As our untargeted extraction procedure is run per sample and per mass trace the implementation of it is scalable. Because of the generic approach, we envision that this data extraction lipids method will be used in a targeted as well as untargeted analysis of many different kinds of TOF-MS data, even CE- and GC-MS data or MRM. The Matlab package is available for download on request and efforts are directed toward a user-friendly Windows executable.


Assuntos
Algoritmos , Cromatografia Líquida de Alta Pressão , Espectrometria de Massas , Estatística como Assunto/métodos , Software
14.
Aging Cell ; 12(3): 426-34, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23451766

RESUMO

Middle-aged offspring of nonagenarians, as compared to their spouses (controls), show a favorable lipid metabolism marked by larger LDL particle size in men and lower total triglyceride levels in women. To investigate which specific lipids associate with familial longevity, we explore the plasma lipidome by measuring 128 lipid species using liquid chromatography coupled to mass spectrometry in 1526 offspring of nonagenarians (59 years ± 6.6) and 675 (59 years ± 7.4) controls from the Leiden Longevity Study. In men, no significant differences were observed between offspring and controls. In women, however, 19 lipid species associated with familial longevity. Female offspring showed higher levels of ether phosphocholine (PC) and sphingomyelin (SM) species (3.5-8.7%) and lower levels of phosphoethanolamine PE (38:6) and long-chain triglycerides (TG) (9.4-12.4%). The association with familial longevity of two ether PC and four SM species was independent of total triglyceride levels. In addition, the longevity-associated lipid profile was characterized by a higher ratio of monounsaturated (MUFA) over polyunsaturated (PUFA) lipid species, suggesting that female offspring have a plasma lipidome less prone to oxidative stress. Ether PC and SM species were identified as novel longevity markers in females, independent of total triglycerides levels. Several longevity-associated lipids correlated with a lower risk of hypertension and diabetes in the Leiden Longevity Study cohort. This sex-specific lipid signature marks familial longevity and may suggest a plasma lipidome with a better antioxidant capacity, lower lipid peroxidation and inflammatory precursors, and an efficient beta-oxidation function.


Assuntos
Envelhecimento/sangue , Metabolismo dos Lipídeos , Lipídeos/sangue , Longevidade , Adulto , Idoso , Idoso de 80 Anos ou mais , Antioxidantes/metabolismo , Biomarcadores/sangue , Cromatografia Líquida , Estudos de Coortes , Etanolaminas/sangue , Feminino , Humanos , Masculino , Espectrometria de Massas , Pessoa de Meia-Idade , Estresse Oxidativo , Fosforilcolina/sangue , Fatores Sexuais , Esfingomielinas/sangue , Triglicerídeos/sangue
15.
Metabolomics ; 7(1): 1-14, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-21461033

RESUMO

Due to the complexity of typical metabolomics samples and the many steps required to obtain quantitative data in GC × GC-MS consisting of deconvolution, peak picking, peak merging, and integration, the unbiased non-target quantification of GC × GC-MS data still poses a major challenge in metabolomics analysis. The feasibility of using commercially available software for non-target processing of GC × GC-MS data was assessed. For this purpose a set of mouse liver samples (24 study samples and five quality control (QC) samples prepared from the study samples) were measured with GC × GC-MS and GC-MS to study the development and progression of insulin resistance, a primary characteristic of diabetes type 2. A total of 170 and 691 peaks were quantified in, respectively, the GC-MS and GC × GC-MS data for all study and QC samples. The quantitative results for the QC samples were compared to assess the quality of semi-automated GC × GC-MS processing compared to targeted GC-MS processing which involved time-consuming manual correction of all wrongly integrated metabolites and was considered as golden standard. The relative standard deviations (RSDs) obtained with GC × GC-MS were somewhat higher than with GC-MS, due to less accurate processing. Still, the biological information in the study samples was preserved and the added value of GC × GC-MS was demonstrated; many additional candidate biomarkers were found with GC × GC-MS compared to GC-MS. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-010-0219-6) contains supplementary material, which is available to authorized users.

16.
Anal Chem ; 82(3): 1039-46, 2010 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-20052990

RESUMO

Combination of data sets from different objects (for example, from two groups of healthy volunteers from the same population) that were measured on a common set of variables (for example, metabolites or peptides) is desirable for statistical analysis in "omics" studies because it increases power. However, this type of combination is not directly possible if nonbiological systematic differences exist among the individual data sets, or "blocks". Such differences can, for example, be due to small analytical changes that are likely to accumulate over large time intervals between blocks of measurements. In this article we present a data transformation method, that we will refer to as "quantile equating", which per variable corrects for linear and nonlinear differences in distribution among blocks of semiquantitative data obtained with the same analytical method. We demonstrate the successful application of the quantile equating method to data obtained on two typical metabolomics platforms, i.e., liquid chromatography-mass spectrometry and nuclear magnetic resonance spectroscopy. We suggest uni- and multivariate methods to evaluate similarities and differences among data blocks before and after quantile equating. In conclusion, we have developed a method to correct for nonbiological systematic differences among semiquantitative data blocks and have demonstrated its successful application to metabolomics data sets.


Assuntos
Cromatografia Líquida de Alta Pressão/métodos , Lipídeos/sangue , Espectroscopia de Ressonância Magnética/métodos , Espectrometria de Massas/métodos , Metabolômica/métodos , Adolescente , Algoritmos , Estudos de Coortes , Feminino , Humanos , Lipídeos/química , Masculino , Análise de Componente Principal , Irmãos , Gêmeos , Adulto Jovem
17.
J Proteome Res ; 8(11): 5132-41, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19754161

RESUMO

Analytical errors caused by suboptimal performance of the chosen platform for a number of metabolites and instrumental drift are a major issue in large-scale metabolomics studies. Especially for MS-based methods, which are gaining common ground within metabolomics, it is difficult to control the analytical data quality without the availability of suitable labeled internal standards and calibration standards even within one laboratory. In this paper, we suggest a workflow for significant reduction of the analytical error using pooled calibration samples and multiple internal standard strategy. Between and within batch calibration techniques are applied and the analytical error is reduced significantly (increase of 25% of peaks with RSD lower than 20%) and does not hamper or interfere with statistical analysis of the final data.


Assuntos
Metabolômica , Algoritmos , Calibragem/normas , Espectrometria de Massas/métodos , Metabolômica/métodos , Metabolômica/normas , Fenótipo , Controle de Qualidade , Reprodutibilidade dos Testes
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA