Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 315
Filtrar
1.
Nat Commun ; 13(1): 744, 2022 02 08.
Artículo en Inglés | MEDLINE | ID: mdl-35136070

RESUMEN

The integration of genomics and proteomics data (proteogenomics) holds the promise of furthering the in-depth understanding of human disease. However, sample mix-up is a pervasive problem in proteogenomics because of the complexity of sample processing. Here, we present a pipeline for Sample Matching in Proteogenomics (SMAP) to verify sample identity and ensure data integrity. SMAP infers sample-dependent protein-coding variants from quantitative mass spectrometry (MS), and aligns the MS-based proteomic samples with genomic samples by two discriminant scores. Theoretical analysis with simulated data indicates that SMAP is capable of uniquely matching proteomic and genomic samples when ≥20% genotypes of individual samples are available. When SMAP was applied to a large-scale dataset generated by the PsychENCODE BrainGVEX project, 54 samples (19%) were corrected. The correction was further confirmed by ribosome profiling and chromatin sequencing (ATAC-seq) data from the same set of samples. Our results demonstrate that SMAP is an effective tool for sample verification in a large-scale MS-based proteogenomics study. SMAP is publicly available at https://github.com/UND-Wanglab/SMAP , and a web-based version can be accessed at https://smap.shinyapps.io/smap/ .


Asunto(s)
Conjuntos de Datos como Asunto , Proteogenómica/métodos , Secuenciación de Inmunoprecipitación de Cromatina , Análisis de Datos , Femenino , Humanos , Masculino , Espectrometría de Masas/métodos , Espectrometría de Masas/estadística & datos numéricos , Proteogenómica/estadística & datos numéricos , RNA-Seq , Programas Informáticos , Secuenciación Completa del Genoma
2.
J Hepatol ; 75(6): 1301-1311, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34271004

RESUMEN

BACKGROUND & AIMS: Patients with HCV who achieve a sustained virological response (SVR) on direct-acting antiviral (DAA) therapy still need to be monitored for signs of liver disease progression. To this end, the identification of both disease biomarkers and therapeutic targets is necessary. METHODS: Extracellular vesicles (EVs) purified from plasma of 15 healthy donors (HDs), and 16 HCV-infected patients before (T0) and after (T6) DAA treatment were utilized for functional and miRNA cargo analysis. EVs purified from plasma of 17 HDs and 23 HCV-infected patients (T0 and T6) were employed for proteomic and western blot analyses. Functional analysis in LX2 cells measured fibrotic markers (mRNAs and proteins) in response to EVs. Structural analysis was performed by qPCR, label-free liquid chromatography-mass spectrometry and western blot. RESULTS: On the basis of observations indicating functional differences (i.e. modulation of FN-1, ACTA2, Smad2/3 phosphorylation, collagen deposition) of plasma-derived EVs from HDs, T0 and T6, we performed structural analysis of EVs. We found consistent differences in terms of both miRNA and protein cargos: (i) antifibrogenic miR204-5p, miR181a-5p, miR143-3p, miR93-5p and miR122-5p were statistically underrepresented in T0 EVs compared to HD EVs, while miR204-5p and miR143-3p were statistically underrepresented in T6 EVs compared to HD EVs (p <0.05); (ii) proteomic analysis highlighted, in both T0 and T6, the modulation of several proteins with respect to HDs; among them, the fibrogenic protein DIAPH1 was upregulated (Log2 fold change of 4.4). CONCLUSIONS: Taken together, these results highlight structural EV modifications that are conceivably causal for long-term liver disease progression in patients with HCV despite DAA-mediated SVR. LAY SUMMARY: Direct-acting antivirals lead to virological cure in the majority of patients with chronic hepatitis C virus infection. However, the risk of liver disease progression or complications in patients with fibrosis and cirrhosis remains in some patients even after virological cure. Herein, we show that extracellular vesicle modifications could be linked to long-term liver disease progression in patients who have achieved virological cure; these modifications could potentially be used as biomarkers or treatment targets in such patients.


Asunto(s)
Antivirales/farmacología , Hepacivirus/fisiología , Hepatitis C/tratamiento farmacológico , Respuesta Virológica Sostenida , Antivirales/uso terapéutico , Comunicación Celular/efectos de los fármacos , Comunicación Celular/fisiología , Hepatitis C/fisiopatología , Humanos , Espectrometría de Masas/métodos , Espectrometría de Masas/estadística & datos numéricos
3.
J Am Soc Mass Spectrom ; 32(8): 2110-2122, 2021 Aug 04.
Artículo en Inglés | MEDLINE | ID: mdl-34190546

RESUMEN

Liquid chromatography-mass spectrometry (LC-MS) is one of the most widely used analytical tools. High analysis volumes and sample complexity often demand more informative LC-MS acquisition schemes to improve efficiency and throughput without compromising data quality, and such a demand has been always hindered by the prerequisite that a minimum of 13-20 MS scans (data points) across an analyte peak are required for accurate quantitation. The current study systematically re-evaluated and compared the impact of different scan numbers on quantitation analysis using both triple quadrupoles mass spectrometry (TQMS) and high-resolution mass spectrometry (HRMS). Contrary to the 13-20 minimal scan prerequisite, the data obtained from a group of eight commercial drugs in the absence and presence of biological matrices suggest that 6 scans per analyte peak are sufficient to achieve highly comparable quantitation results compared to that obtained using 10 and 20 scans, respectively. The fewer minimal scan prerequisite is presumably attributed to an improved LC system and advanced column technology, better MS detector, and more intelligent peak detection and integration algorithms leading to a more symmetric peak shape and smaller peak standard deviation. As a result, more informative acquisition schemes can be broadly set up for higher throughput and more data-rich LC-MS/MS analysis as demonstrated in a hepatocyte clearance assay in which fewer MS scans executed on HRMS led to broader metabolite coverage without compromising data quality in hepatic clearance assessment. The demonstrated acquisition scheme would substantially increase the throughput, robustness, and richness of the nonregulatory analysis, which can be broadly applied in diverse fields including pharmaceutical, environmental, forensic, toxicological, and biotechnological.


Asunto(s)
Cromatografía Liquida/métodos , Inactivación Metabólica , Espectrometría de Masas/métodos , Preparaciones Farmacéuticas/análisis , Animales , Cromatografía Liquida/estadística & datos numéricos , Perros , Haplorrinos , Hepatocitos/efectos de los fármacos , Hepatocitos/metabolismo , Humanos , Espectrometría de Masas/estadística & datos numéricos , Ratones , Preparaciones Farmacéuticas/química , Farmacocinética , Ratas , Reproducibilidad de los Resultados , Sensibilidad y Especificidad
4.
J Am Soc Mass Spectrom ; 32(6): 1278-1294, 2021 Jun 02.
Artículo en Inglés | MEDLINE | ID: mdl-33983025

RESUMEN

Top-down mass spectrometry (MS)-based proteomics is a powerful technology for comprehensively characterizing proteoforms to decipher post-translational modifications (PTMs) together with genetic variations and alternative splicing isoforms toward a proteome-wide understanding of protein functions. In the past decade, top-down proteomics has experienced rapid growth benefiting from groundbreaking technological advances, which have begun to reveal the potential of top-down proteomics for understanding basic biological functions, unraveling disease mechanisms, and discovering new biomarkers. However, many challenges remain to be comprehensively addressed. In this Account & Perspective, we discuss the major challenges currently facing the top-down proteomics field, particularly in protein solubility, proteome dynamic range, proteome complexity, data analysis, proteoform-function relationship, and analytical throughput for precision medicine. We specifically review the major technology developments addressing these challenges with an emphasis on our research group's efforts, including the development of top-down MS-compatible surfactants for protein solubilization, functionalized nanoparticles for the enrichment of low-abundance proteoforms, strategies for multidimensional chromatography separation of proteins, and a new comprehensive user-friendly software package for top-down proteomics. We have also made efforts to connect proteoforms with biological functions and provide our visions on what the future holds for top-down proteomics.


Asunto(s)
Espectrometría de Masas/métodos , Proteínas/química , Proteoma/análisis , Proteómica/métodos , Humanos , Espectrometría de Masas/estadística & datos numéricos , Medicina de Precisión/métodos , Procesamiento Proteico-Postraduccional , Proteínas/metabolismo , Proteoma/metabolismo , Programas Informáticos , Solubilidad
5.
Methods Mol Biol ; 2228: 1-20, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33950479

RESUMEN

Mass spectrometry is frequently used in quantitative proteomics to detect differentially regulated proteins. A very important but unfortunately oftentimes neglected part in detecting differential proteins is the statistical analysis. Data from proteomics experiments are usually high-dimensional and hence require profound statistical methods. It is especially important to already correctly design a proteomic experiment before it is conducted in the laboratory. Only this can ensure that the statistical analysis is capable of detecting truly differential proteins afterward. This chapter thus covers aspects of both statistical planning as well as the actual analysis of quantitative proteomic experiments.


Asunto(s)
Espectrometría de Masas/estadística & datos numéricos , Proteínas/análisis , Proteoma , Proteómica/estadística & datos numéricos , Proyectos de Investigación/estadística & datos numéricos , Animales , Interpretación Estadística de Datos , Humanos , Modelos Estadísticos
6.
J Proteome Res ; 20(3): 1464-1475, 2021 03 05.
Artículo en Inglés | MEDLINE | ID: mdl-33605735

RESUMEN

The SARS-CoV-2 virus is the causative agent of the 2020 pandemic leading to the COVID-19 respiratory disease. With many scientific and humanitarian efforts ongoing to develop diagnostic tests, vaccines, and treatments for COVID-19, and to prevent the spread of SARS-CoV-2, mass spectrometry research, including proteomics, is playing a role in determining the biology of this viral infection. Proteomics studies are starting to lead to an understanding of the roles of viral and host proteins during SARS-CoV-2 infection, their protein-protein interactions, and post-translational modifications. This is beginning to provide insights into potential therapeutic targets or diagnostic strategies that can be used to reduce the long-term burden of the pandemic. However, the extraordinary situation caused by the global pandemic is also highlighting the need to improve mass spectrometry data and workflow sharing. We therefore describe freely available data and computational resources that can facilitate and assist the mass spectrometry-based analysis of SARS-CoV-2. We exemplify this by reanalyzing a virus-host interactome data set to detect protein-protein interactions and identify host proteins that could potentially be used as targets for drug repurposing.


Asunto(s)
COVID-19/virología , Difusión de la Información/métodos , Espectrometría de Masas/métodos , SARS-CoV-2/química , COVID-19/epidemiología , Prueba de COVID-19/métodos , Prueba de COVID-19/estadística & datos numéricos , Biología Computacional , Bases de Datos de Proteínas/estadística & datos numéricos , Reposicionamiento de Medicamentos , Interacciones Microbiota-Huesped/fisiología , Humanos , Espectrometría de Masas/estadística & datos numéricos , Pandemias , Dominios y Motivos de Interacción de Proteínas , Mapas de Interacción de Proteínas , Procesamiento Proteico-Postraduccional , Proteómica/métodos , Proteómica/estadística & datos numéricos , SARS-CoV-2/patogenicidad , SARS-CoV-2/fisiología , Proteínas Virales/química , Proteínas Virales/fisiología , Tratamiento Farmacológico de COVID-19
7.
J Mass Spectrom ; 56(1): e4658, 2021 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-33084147

RESUMEN

Metabolism is the set of life-sustaining reactions in organisms. These biochemical reactions are organized in metabolic pathways, in which one metabolite is converted through a series of steps catalyzed by enzymes in another chemical compound. Metabolic reactions are categorized as catabolic, the breaking down of metabolites to produce energy, and/or anabolic, the synthesis of compounds that consume energy. The balance between catabolism of the preferential fuel substrate and anabolism defines the overall metabolism of a cell or tissue. Metabolomics is a powerful tool to gain new insights contributing to the identification of complex molecular mechanisms in the field of biomedical research, both basic and translational. The enormous potential of this kind of analyses consists of two key aspects: (i) the possibility of performing so-called targeted and untargeted experiments through which it is feasible to verify or formulate a hypothesis, respectively, and (ii) the opportunity to run either steady-state analyses to have snapshots of the metabolome at a given time under different experimental conditions or dynamic analyses through the use of labeled tracers. In this review, we will highlight the most important practical (e.g., different sample extraction approaches) and conceptual steps to consider for metabolomic analysis, describing also the main application contexts in which it is used. In addition, we will provide some insights into the most innovative approaches and progress in the field of data analysis and processing, highlighting how this part is essential for the proper extrapolation and interpretation of data.


Asunto(s)
Espectrometría de Masas/métodos , Metabolómica/métodos , Animales , Congelación , Humanos , Espectrometría de Masas/estadística & datos numéricos , Metabolómica/estadística & datos numéricos , Solubilidad , Manejo de Especímenes
8.
Nucleic Acids Res ; 49(D1): D1523-D1528, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33174597

RESUMEN

For the reproducibility and sustainability of scientific research, FAIRness (Findable, Accessible, Interoperable and Re-usable), with respect to the release of raw data obtained by researchers, is one of the most important principles underpinning the future of open science. In genomics and transcriptomics, the sharing of raw data from next-generation sequencers is made possible through public repositories. In addition, in proteomics, the deposition of raw data from mass spectrometry (MS) experiments into repositories is becoming standardized. However, a standard repository for such MS data had not yet been established in glycomics. With the increasing number of glycomics MS data, therefore, we have developed GlycoPOST (https://glycopost.glycosmos.org/), a repository for raw MS data generated from glycomics experiments. In just the first year since the release of GlycoPOST, 73 projects have already been registered by researchers around the world, and the number of registered projects is continuously growing, making a significant contribution to the future FAIRness of the glycomics field. GlycoPOST is a free resource to the community and accepts (and will continue to accept in the future) raw data regardless of vendor-specific formats.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Factuales , Glicómica/métodos , Espectrometría de Masas/estadística & datos numéricos , Programas Informáticos , Glicómica/normas , Humanos , Difusión de la Información/ética , Internet , Espectrometría de Masas/métodos , Espectrometría de Masas/normas , Reproducibilidad de los Resultados , Manejo de Especímenes/métodos , Manejo de Especímenes/normas
9.
Nat Commun ; 11(1): 5595, 2020 11 05.
Artículo en Inglés | MEDLINE | ID: mdl-33154370

RESUMEN

Rapid and accurate clinical diagnosis remains challenging. A component of diagnosis tool development is the design of effective classification models with Mass spectrometry (MS) data. Some Machine Learning approaches have been investigated but these models require time-consuming preprocessing steps to remove artifacts, making them unsuitable for rapid analysis. Convolutional Neural Networks (CNNs) have been found to perform well under such circumstances since they can learn representations from raw data. However, their effectiveness decreases when the number of available training samples is small, which is a common situation in medicine. In this work, we investigate transfer learning on 1D-CNNs, then we develop a cumulative learning method when transfer learning is not powerful enough. We propose to train the same model through several classification tasks over various small datasets to accumulate knowledge in the resulting representation. By using rat brain as the initial training dataset, a cumulative learning approach can have a classification accuracy exceeding 98% for 1D clinical MS-data. We show the use of cumulative learning using datasets generated in different biological contexts, on different organisms, and acquired by different instruments. Here we show a promising strategy for improving MS data classification accuracy when only small numbers of samples are available.


Asunto(s)
Aprendizaje Profundo , Espectrometría de Masas/métodos , Redes Neurales de la Computación , Animales , Bases de Datos Factuales , Diagnóstico por Computador , Humanos , Aprendizaje Automático , Espectrometría de Masas/estadística & datos numéricos
10.
Anal Chem ; 92(16): 10872-10880, 2020 08 18.
Artículo en Inglés | MEDLINE | ID: mdl-32667808

RESUMEN

Native mass spectrometry (MS) allows the interrogation of structural aspects of macromolecules in the gas phase, under the premise of having initially maintained their solution-phase noncovalent interactions intact. In the more than 25 years since the first reports, the utility of native MS has become well established in the structural biology community. The experimental and technological advances during this time have been rapid, resulting in dramatic increases in sensitivity, mass range, resolution, and complexity of possible experiments. As experimental methods have improved, there have been accompanying developments in computational approaches for analyzing and exploiting the profusion of MS data in a structural and biophysical context. In this perspective, we consider the computational strategies currently being employed by the community, aspects of best practice, and the challenges that remain to be addressed. Our perspective is based on discussions within the European Cooperation in Science and Technology Action on Native Mass Spectrometry and Related Methods for Structural Biology (EU COST Action BM1403), which involved participants from across Europe and North America. It is intended not as an in-depth review but instead to provide an accessible introduction to and overview of the topic-to inform newcomers to the field and stimulate discussions in the community about addressing existing challenges. Our complementary perspective (http://dx.doi.org/10.1021/acs.analchem.9b05792) focuses on software tools available to help researchers tackle some of the challenges enumerated here.


Asunto(s)
Biofisica/métodos , Biología Computacional/métodos , Espectrometría de Masas/estadística & datos numéricos , Espectrometría de Masas/métodos , Proteínas/análisis
11.
Anal Chem ; 92(16): 11155-11163, 2020 08 18.
Artículo en Inglés | MEDLINE | ID: mdl-32662991

RESUMEN

Rotationally averaged collision cross section (CCS) values for a series of proteins and protein complexes ranging in size from 8.6 to 810 kDa are reported. The CCSs were obtained using a native electrospray ionization drift tube ion mobility-Orbitrap mass spectrometer specifically designed to enhance sensitivity while having high-resolution ion mobility and mass capabilities. Periodic focusing (PF)-drift tube (DT)-ion mobility (IM) provides first-principles determination of the CCS of large biomolecules that can then be used as CCS calibrants. The experimental, first-principles CCS values are compared to previously reported experimentally determined and computationally calculated CCS using projected superposition approximation (PSA), the Ion Mobility Projection Approximation Calculation Tool (IMPACT), and Collidoscope. Experimental CCS values are generally in agreement with previously reported CCSs, with values falling within ∼5.5%. In addition, an ion mobility resolution (CCS centroid divided by CCS fwhm) of ∼60 is obtained for pyruvate kinase (MW ∼ 233 kDa); however, ion mobility resolution for bovine serum albumin (MW ∼ 68 kDa) is less than ∼20, which arises from sample impurities and underscores the importance of sample quality. The high resolution afforded by the ion mobility-Orbitrap mass analyzer provides new opportunities to understand the intricate details of protein complexes such as the impact of post-translational modifications (PTMs), stoichiometry, and conformational changes induced by ligand binding.


Asunto(s)
Proteínas/química , Animales , Bovinos , Espectrometría de Movilidad Iónica/métodos , Espectrometría de Movilidad Iónica/estadística & datos numéricos , Espectrometría de Masas/métodos , Espectrometría de Masas/estadística & datos numéricos , Estructura Cuaternaria de Proteína , Conejos
13.
Biochem Biophys Res Commun ; 525(4): 863-869, 2020 05 14.
Artículo en Inglés | MEDLINE | ID: mdl-32171522

RESUMEN

Evidences suggest that dietary docosahexaenoic acid (DHA) supplementation may have pleiotropic beneficial effects on health. However, the underlying mechanisms and crucial targets that are involved in achieving these benefits remain to be clarified. In this study, we employed biochemical analysis and liquid chromatography-mass spectrometry (LC-MS) based untargeted metabolomics coupled with multivariate statistical analysis to identify potential metabolic targets of DHA in adult rats at 48 h post-feeding. Blood biochemical analysis showed a significant decrease in triglyceride level of DHA diet group, the untargeted metabolomic analysis revealed that some metabolites were significantly different between the DHA diet group and the basal diet group, including fatty acids (16:0, 18:1, 20:5n3, 22:2n6 and 24:0), diglyceride (20:0/18:2n6, 18:3n6/22:6n3, 20:4n3/20:4n3, and 22:0/24:0), PIP2 (18:2/20:3), phytol, lysoSM (d18:1), 12-hydroxyheptadecatrienoic acid, dihydrocorticosterone and N1-acetylspermine, which are mainly involved in fat mobilization and triglyceride hydrolysis, arachidonic acid, steroid hormone, and polyamine metabolism. To our knowledge, this is the first report that links the metabolic effects of DHA with arachidonic acid, steroid, and polyamine metabolism. Our finding suggests that the beneficial effects of DHA, may not directly require its own metabolic derivatives, but could be achieved by metabolic regulation.


Asunto(s)
Ácido Araquidónico/sangre , Ácidos Docosahexaenoicos/metabolismo , Ácidos Docosahexaenoicos/farmacología , Triglicéridos/sangre , Animales , Análisis Químico de la Sangre , Cromatografía Liquida , Suplementos Dietéticos , Ácidos Docosahexaenoicos/sangre , Análisis de los Mínimos Cuadrados , Espectrometría de Masas/estadística & datos numéricos , Poliaminas/sangre , Ratas , Reproducibilidad de los Resultados
14.
Food Chem ; 315: 126248, 2020 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-32018076

RESUMEN

Chianti is a precious red wine and enjoys a high reputation for its high quality in the world wine market. Despite this, the production region is small and product needs efficient tools to protect its brands and prevent adulterations. In this sense, ICP-MS combined with chemometrics has demonstrated its usefulness in food authentication. In this study, Chianti/Chianti Classico, authentic wines from vineyard of Toscana region (Italy), together samples from 18 different geographical regions, were analyzed with the objective of differentiate them from other Italian wines. Partial Least Squares-Discriminant Analysis (PLS-DA) identified variables to discriminate wine geographical origin. Rare Earth Elements (REE), major and trace elements all contributed to the discrimination of Chianti samples. General model was not suited to distinguish PDO red wines from samples, with similar chemical fingerprints, collected in some regions. Specific classification models enhanced the capability of discrimination, emphasizing the discriminant role of some elements.


Asunto(s)
Análisis de los Alimentos/métodos , Espectrometría de Masas/métodos , Vino/análisis , Análisis Discriminante , Análisis de los Alimentos/estadística & datos numéricos , Italia , Análisis de los Mínimos Cuadrados , Límite de Detección , Espectrometría de Masas/estadística & datos numéricos , Metales de Tierras Raras/análisis , Oligoelementos/análisis
15.
Food Chem ; 315: 126247, 2020 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-32006866

RESUMEN

Three non-targeted methods, i.e. 1H NMR, LC-HRMS, and HS-SPME/MS-eNose, combined with chemometrics, were used to classify two table grape cultivars (Italia and Victoria) based on five quality levels (5, 4, 3, 2, 1). Grapes at marketable quality levels (5, 4, 3) were also discriminated from non-marketable quality levels (2 and 1). PCA-LDA and PLS-DA were applied, and results showed that, the MS-eNose provided the best results. Specifically, with the Italia table grapes, mean prediction abilities ranging from 87% to 88% and from 98% to 99% were obtained for discrimination amongst the five quality levels and of marketability/non-marketability, respectively. For the cultivar Victoria, mean predictive abilities higher than 99% were achieved for both classifications. Good models were also obtained for both cultivars using NMR and HRMS data, but only for classification by marketability. Satisfying models were further validated by MCCV. Finally, the compounds that contributed the most to the discriminations were identified.


Asunto(s)
Análisis de los Alimentos/métodos , Almacenamiento de Alimentos , Espectroscopía de Protones por Resonancia Magnética/métodos , Vitis/química , Nariz Electrónica/estadística & datos numéricos , Análisis de los Alimentos/estadística & datos numéricos , Calidad de los Alimentos , Análisis de los Mínimos Cuadrados , Espectrometría de Masas/métodos , Espectrometría de Masas/estadística & datos numéricos , Análisis Multivariante , Análisis de Componente Principal , Espectroscopía de Protones por Resonancia Magnética/estadística & datos numéricos , Compuestos Orgánicos Volátiles/análisis
16.
Sci Rep ; 10(1): 876, 2020 01 21.
Artículo en Inglés | MEDLINE | ID: mdl-31964922

RESUMEN

Mass spectrometry (MS) is frequently used for proteomic and metabolomic profiling of biological samples. Data obtained by MS are often zero-inflated. Those zero values are called point mass values (PMVs). Zero values can be further grouped into biological PMVs and technical PMVs. The former type is caused by true absence of a compound and the later type is caused by a technical detection limit. Methods based on a mixture model have been developed to separate the two types of zeros and to perform differential abundance analysis comparing proteomic/metabolomic profiles between different groups of subjects. However, we notice that those methods may give unstable estimate of the model variance, and thus lead to false positive and false negative results when the number of non-zero values is small. In this paper, we propose a new differential abundance analysis method, DASEV, which uses an empirical Bayes shrinkage method to more robustly estimate the variance and enhance the accuracy of differential abundance analysis. Simulation studies and real data analysis show that DASEV substantially improves parameter estimation of the mixture model and outperforms current methods in identifying differentially abundant features.


Asunto(s)
Espectrometría de Masas/estadística & datos numéricos , Modelos Estadísticos , Análisis de Varianza , Teorema de Bayes , Carcinoma de Pulmón de Células no Pequeñas/metabolismo , Carcinoma de Pulmón de Células no Pequeñas/patología , Bases de Datos Factuales , Exosomas , Humanos , Metabolismo de los Lípidos , Neoplasias Pulmonares/metabolismo , Neoplasias Pulmonares/patología , Metabolómica/estadística & datos numéricos , Proteinuria/metabolismo , Proteoma/metabolismo , Proteómica/estadística & datos numéricos
17.
Drug Test Anal ; 12(6): 836-845, 2020 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-31997574

RESUMEN

Liquid chromatography coupled to high-resolution mass spectrometry (HRMS) enables data independent acquisition (DIA) and untargeted screening. However, to avoid the handling of the resulting large dataset, most laboratories in that field still use targeted screening methods, which offer good sensitivity and specificity but are limited to known compounds. The promising field of machine learning offers new possibilities such as artificial neural networks that can be trained to classify large amounts of data. In this proof of concept study, we exemplify such a machine learning approach for raw HRMS-DIA data files. We evaluated a machine learning model using training, validation, and test sets of solvent and whole blood samples containing drugs (of abuse) common in forensic toxicology. For that purpose, different platforms were used. With a feedforward neural network model architecture, a category prediction (blank sample vs. drug containing sample) was aimed for. With the applied machine learning approaches, the sensitivity and specificity, of the validation and test set, for the prediction of sample classes were in a suitable range for an actual use in a (routine) laboratory (e.g. workplace drug testing). In conclusion, this proof of concept study clearly demonstrated the huge potential of machine learning in the analysis of HRMS-DIA data.


Asunto(s)
Macrodatos , Aprendizaje Automático , Espectrometría de Masas/estadística & datos numéricos , Redes Neurales de la Computación , Cromatografía Liquida , Cocaína/sangre , Humanos , Prueba de Estudio Conceptual , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Bibliotecas de Moléculas Pequeñas , Detección de Abuso de Sustancias/métodos , Zolpidem/sangre
18.
Methods Mol Biol ; 2104: 419-445, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-31953829

RESUMEN

Rapid advancements in metabolomics technologies have allowed for application of liquid chromatography mass spectrometry (LCMS)-based metabolomics to investigate a wide range of biological questions. In addition to an important role in studies of cellular biochemistry and biomarker discovery, an exciting application of metabolomics is the elucidation of mechanisms of drug action (Creek et al., Antimicrob Agents Chemother 60:6650-6663, 2016; Allman et al., Antimicrob Agents Chemother 60:6635-6649, 2016). Although it is a very useful technique, challenges in raw data processing, extracting useful information out of large noisy datasets, and identifying metabolites with confidence, have meant that metabolomics is still perceived as a highly specialized technology. As a result, metabolomics has not yet achieved the anticipated extent of uptake in laboratories around the world as genomics or transcriptomics. With a view to bring metabolomics within reach of a nonspecialist scientist, here we describe a routine workflow with IDEOM, which is a graphical user interface within Microsoft Excel, which almost all researchers are familiar with. IDEOM consists of custom built algorithms that allow LCMS data processing, automatic noise filtering and identification of metabolite features (Creek et al., Bioinformatics 28:1048-1049, 2012). Its automated interface incorporates advanced LCMS data processing tools, mzMatch and XCMS, and requires R for complete functionality. IDEOM is freely available for all researchers and this chapter will focus on describing the IDEOM workflow for the nonspecialist researcher in the context of studies designed to elucidate mechanisms of drug action.


Asunto(s)
Cromatografía Liquida , Biología Computacional/métodos , Espectrometría de Masas , Metabolómica , Farmacología , Programas Informáticos , Flujo de Trabajo , Cromatografía Liquida/estadística & datos numéricos , Análisis de Datos , Espectrometría de Masas/estadística & datos numéricos , Redes y Vías Metabólicas , Metabolómica/estadística & datos numéricos , Farmacología/estadística & datos numéricos
19.
J Proteome Res ; 19(1): 477-492, 2020 01 03.
Artículo en Inglés | MEDLINE | ID: mdl-31664839

RESUMEN

Targeted analysis of sequential window acquisition of all theoretical mass spectra (SWATH-MS) requires the spectral library, which can be generated by shotgun mass spectrometry (MS) or by the pseudo-spectra files directly obtained from SWATH-MS data. The external library generated by shotgun MS is employed in most SWATH-MS research. However, performance of the internal library, which is constructed by pseudo-spectra files, in the targeted analysis of SWATH-MS has not been systemically evaluated. Here, we show that up to 40% of the peptides detected by the internal library were not overlapped with those detected by the external library for most SWATH-MS data sets. However, the internal library did not identify extra phosphopeptides compared with the external library for phosphoproteomic SWATH-MS data. Therefore, the internal library should be incorporated into the external library for targeted analysis of nonphosphoproteomic SWATH-MS, given that it can significantly increase the number of peptides of SWATH-MS without requiring additional instrument measurement time.


Asunto(s)
Espectrometría de Masas/métodos , Péptidos/análisis , Proteómica/métodos , Animales , Proteínas Sanguíneas/análisis , Línea Celular , Células HeLa , Humanos , Espectrometría de Masas/estadística & datos numéricos , Ratones , Biblioteca de Péptidos , Fosfoproteínas/análisis , Proteómica/estadística & datos numéricos , Flujo de Trabajo
20.
Anal Chem ; 91(22): 14489-14497, 2019 11 19.
Artículo en Inglés | MEDLINE | ID: mdl-31660729

RESUMEN

Authentication of Cannabis products is important for assuring the quality of manufacturing, with the increasing consumption and regulation. In this report, a two-stage pipeline was developed for high-throughput screening and chemotyping the spectra from two sets of botanical extracts from the Cannabis genus. The first set contains different marijuana samples with higher concentrations of tetrahydrocannabinol (THC). The other set includes samples from hemp, a variety of Cannabis sativa with the THC concentration below 0.3%. The first stage applies the technique of class modeling to determine whether spectra belong to marijuana or hemp and reject novel spectra that may be neither marijuana nor hemp. An automatic soft independent modeling of class analogy (aSIMCA) that self-optimizes the number of principal components and the decision threshold is utilized in the first pipeline process to achieve excellent efficiency and efficacy. Once these spectra are recognized by aSIMCA as marijuana or hemp, they are then routed to the appropriate classifiers in the second stage for chemotyping the spectra, i.e., identifying these spectra into different chemotypes so that the pharmacological properties and cultivars of the spectra can be recognized. Three multivariate classifiers, a fuzzy rule building expert system (FuRES), super partial least-squares-discriminant analysis (sPLS-DA), and support vector machine tree type entropy (SVMtreeH), are employed for chemotyping. The discriminant ability of the pipeline was evaluated with different spectral data sets of these two groups of botanical samples, including proton nuclear magnetic resonance, mass, and ultraviolet spectra. All evaluations gave good results with accuracies greater than 95%, which demonstrated promising application of the pipeline for automated high-throughput screening and chemotyping marijuana and hemp, as well as other botanical products.


Asunto(s)
Cannabis/química , Cannabis/clasificación , Ensayos Analíticos de Alto Rendimiento/métodos , Extractos Vegetales/análisis , Análisis Discriminante , Lógica Difusa , Ensayos Analíticos de Alto Rendimiento/estadística & datos numéricos , Análisis de los Mínimos Cuadrados , Espectrometría de Masas/estadística & datos numéricos , Modelos Químicos , Espectroscopía de Protones por Resonancia Magnética/estadística & datos numéricos , Máquina de Vectores de Soporte
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...