RESUMO
MOTIVATION: The analysis of biological samples in untargeted metabolomic studies using LC-MS yields tens of thousands of ion signals. Annotating these features is of the utmost importance for answering questions as fundamental as, e.g. how many metabolites are there in a given sample. RESULTS: Here, we introduce CliqueMS, a new algorithm for annotating in-source LC-MS1 data. CliqueMS is based on the similarity between coelution profiles and therefore, as opposed to most methods, allows for the annotation of a single spectrum. Furthermore, CliqueMS improves upon the state of the art in several dimensions: (i) it uses a more discriminatory feature similarity metric; (ii) it treats the similarities between features in a transparent way by means of a simple generative model; (iii) it uses a well-grounded maximum likelihood inference approach to group features; (iv) it uses empirical adduct frequencies to identify the parental mass and (v) it deals more flexibly with the identification of the parental mass by proposing and ranking alternative annotations. We validate our approach with simple mixtures of standards and with real complex biological samples. CliqueMS reduces the thousands of features typically obtained in complex samples to hundreds of metabolites, and it is able to correctly annotate more metabolites and adducts from a single spectrum than available tools. AVAILABILITY AND IMPLEMENTATION: https://CRAN.R-project.org/package=cliqueMS and https://github.com/osenan/cliqueMS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Software , Espectrometria de Massas em Tandem , Cromatografia Líquida , Íons , Metabolômica , Redes Neurais de ComputaçãoRESUMO
Thrombus formation is a multiscale phenomenon triggered by platelet deposition over a protrombotic surface (eg. a ruptured atherosclerotic plaque). Despite the medical urgency for computational tools that aid in the early diagnosis of thrombotic events, the integration of computational models of thrombus formation at different scales requires a comprehensive understanding of the role and limitation of each modelling approach. We propose three different modelling approaches to predict platelet deposition. Specifically, we consider measurements of platelet deposition under blood flow conditions in a perfusion chamber for different time periods (3, 5, 10, 20 and 30 minutes) at shear rates of 212 s(-1), 1390 s(-1) and 1690 s(-1). Our modelling approaches are: i) a model based on the mass-transfer boundary layer theory; ii) a machine-learning approach; and iii) a phenomenological model. The results indicate that the three approaches on average have median errors of 21%, 20.7% and 14.2%, respectively. Our study demonstrates the feasibility of using an empirical data set as a proxy for a real-patient scenario in which practitioners have accumulated data on a given number of patients and want to obtain a diagnosis for a new patient about whom they only have the current observation of a certain number of variables.