RESUMO
The aging process is associated with physiological, sensory, psychological, and sociological changes likely to have an impact on food intake and the nutritional status. The present study aimed to explore the heterogeneity of the French older population (>65 years old) using a multidisciplinary approach. More specifically, the study aimed to highlight different typologies (i.e. clusters of individuals with similar characteristics) within the older population. We conducted face-to-face interviews and tests with 559 French older people, recruited from different categories of dependency (at home without help, at home with help, in nursing homes). Clustering analysis highlighted seven clusters. Clusters 1-3 contained 'young' older people (<80) with a good nutritional status; these clusters differed according to food preferences, the desire to have a healthy diet, or interest in food. Clusters 4-7 mainly contained 'old' older people (80+), with an increase in the nutritional risk from cluster 4 to cluster 7. Two of these clusters grouped healthy and active people with a good level of appetite, while the two other clusters were associated with a clear decline in nutritional status, with people suffering from eating difficulties or depression. The results raise the need to develop targeted interventions to tackle malnutrition and implement health promotion strategies among the seniors.
Assuntos
Envelhecimento Saudável , Desnutrição , Idoso , Envelhecimento , Nível de Saúde , Humanos , Estilo de Vida , Estado Nutricional , PercepçãoRESUMO
Collision cross section (CCS) databases based on single-laboratory measurements must be cross-validated to extend their use in peak annotation. This work addresses the validation of the first comprehensive TWCCSN2 database for steroids. First, its long-term robustness was evaluated (i.e., a year and a half after database generation; Synapt G2-S instrument; bias within ±1.0% for 157 ions, 95.7% of the total ions). It was further cross-validated by three external laboratories, including two different TWIMS platforms (i.e., Synapt G2-Si and two Vion IMS QToF; bias within the threshold of ±2.0% for 98.8, 79.9, and 94.0% of the total ions detected by each instrument, respectively). Finally, a cross-laboratory TWCCSN2 database was built for 87 steroids (142 ions). The cross-laboratory database consists of average TWCCSN2 values obtained by the four TWIMS instruments in triplicate measurements. In general, lower deviations were observed between TWCCSN2 measurements and reference values when the cross-laboratory database was applied as a reference instead of the single-laboratory database. Relative standard deviations below 1.5% were observed for interlaboratory measurements (<1.0% for 85.2% of ions) and bias between average values and TWCCSN2 measurements was within the range of ±1.5% for 96.8% of all cases. In the context of this interlaboratory study, this threshold was also suitable for TWCCSN2 measurements of steroid metabolites in calf urine. Greater deviations were observed for steroid sulfates in complex urine samples of adult bovines, showing a slight matrix effect. The implementation of a scoring system for the application of the CCS descriptor in peak annotation is also discussed.
Assuntos
Esteroides/urina , Animais , Bovinos , Bases de Dados Factuais , Espectrometria de Mobilidade Iônica , Esteroides/metabolismoRESUMO
Modern environmental epidemiology benefits from a new generation of technologies that enable comprehensive profiling of biomarkers, including environmental chemical exposure and omic datasets. The integration and analysis of large and structured datasets to identify functional associations is constrained by computational challenges that cannot be overcome using conventional regression methods. Some extensions of Partial Least Squares (PLS) regression have been developed to efficently integrate multiple datasets, including Multiblock PLS (MB-PLS) and Sequential and Orthogonalized PLS; however, these approaches remain seldom applied in environmental epidemiology. To address that research gap, this study aimed to assess and compare the applicability of PLS-based multiblock models in an observational case study, where biomarkers of exposure to environmental chemicals and endogenous biomarkers of effect were simultaneously integrated to highlight biological links related to a health outcome. The methods were compared with and without sparsity coupling two metrics to support the variable selection: Variable Importance in Projection (VIP) and Selectivity Ratio (SR). The framework was applied to a case-study dataset mimicking the structure of 36 environmental exposure biomarkers (E-block), 61 inflammation biomarkers (M-block), and their relationships with the gestational age at delivery of 161 mother-infant pairs. The results showed an overall consistency in the selected variables across models, although some specific selection patterns were identified. The block-scaled concatenation-based approaches (e.g. MB-PLS) tended to select more variables from the E-block, while these methods were unable to identify certain variables in the M-block. Overall, the number of variables selected using the SR criterion was higher than using the VIP criterion, with lower predictive performances. The multiblock models coupled to VIP, appeared to be the methods of choice for identifying relevant variables with similar statistical performances. Overall, the use of multiblock PLS-based methods appears to be a good strategy to efficiently support the variable selection process in modern environmental epidemiology.
Assuntos
Biomarcadores , Exposição Ambiental , Biomarcadores/análise , Humanos , Exposição Ambiental/estatística & dados numéricos , Análise dos Mínimos Quadrados , Saúde Ambiental , Poluentes Ambientais/análise , FemininoRESUMO
BACKGROUND: The most near-term clinical application of genome-wide association studies in lung cancer is a polygenic risk score (PRS). METHODS: A case-control dataset was generated consisting of 4002 lung cancer cases from the LORD project and 20,010 ethnically matched controls from CARTaGENE. A genome-wide PRS including >1.1 million genetic variants was derived and validated in UK Biobank (n = 5419 lung cancer cases). The predictive ability and diagnostic discrimination performance of the PRS was tested in LORD/CARTaGENE and benchmarked against previous PRSs from the literature. Stratified analyses were performed by smoking status and genetic risk groups defined as low (<20th percentile), intermediate (20-80th percentile) and high (>80th percentile) PRS. FINDINGS: The phenotypic variance explained and the effect size of the genome-wide PRS numerically outperformed previous PRSs. Individuals with high genetic risk had a 2-fold odds of lung cancer compared to low genetic risk. The PRS was an independent predictor of lung cancer beyond conventional clinical risk factors, but its diagnostic discrimination performance was incremental in an integrated risk model. Smoking increased the odds of lung cancer by 7.7-fold in low genetic risk and by 11.3-fold in high genetic risk. Smoking with high genetic risk was associated with a 17-fold increase in the odds of lung cancer compared to individuals who never smoked and with low genetic risk. INTERPRETATION: Individuals at low genetic risk are not protected against the smoking-related risk of lung cancer. The joint multiplicative effect of PRS and smoking increases the odds of lung cancer by nearly 20-fold. FUNDING: This work was supported by the CQDM and the IUCPQ Foundation owing to a generous donation from Mr. Normand Lord.
Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Neoplasias Pulmonares , Herança Multifatorial , Fumar , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/epidemiologia , Neoplasias Pulmonares/etiologia , Estudos de Casos e Controles , Fumar/efeitos adversos , Fumar/epidemiologia , Feminino , Masculino , Pessoa de Meia-Idade , Idoso , Fatores de Risco , Canadá/epidemiologia , Polimorfismo de Nucleotídeo Único , França/epidemiologiaRESUMO
Studying a large number of variables measured on the same observations and organized in blocks - denoted multiblock data - is becoming standard in several domains especially in biology. To explore the relationships between all these variables - at the block- and the variable-level - several exploratory multiblock methods were proposed. However, most of them are only designed for numeric variables. In reality, some data sets contain variables of different measurement levels (i.e., numeric, nominal, ordinal). In this article, we focus on exploratory multiblock methods that handle variables at their appropriate measurement level. Multi-Block Principal Component Analysis with Optimal Scaling (MBPCA-OS) is proposed and applied to multiblock data from the CURIE-O-SA French cohort. In this study, variables are of different measurement levels and organized in four blocks. The objective is to study the immune responses according to the SARS-CoV-2 infection and vaccination statuses, the symptoms and the participant's characteristics.
RESUMO
Humans are exposed to a growing list of synthetic chemicals, some of them becoming a major public health concern due to their capacity to impact multiple biological endpoints and contribute to a range of chronic diseases. The integration of endogenous (omic) biomarkers of effect in environmental health studies has been growing during the last decade, aiming to gain insight into potential mechanisms linking the exposures and the clinical conditions. The emergence of high-throughput omic platforms has raised a list of statistical challenges posed by the large dimension and complexity of data generated. Thus, the aim of the present study was to critically review the current state-of-the-science about statistical approaches used to integrate endogenous biomarkers in environmental-health studies linking chemical exposures with health outcomes. The present review specifically focused on internal exposure to environmental chemical pollutants, involving both persistent organic pollutants (POPs) and non-persistent pollutants like phthalates or bisphenols, and metals. We identified 42 eligible articles published since 2016, reporting 48 different statistical workflows, mostly focused on POPs and using metabolomic profiling in the intermediate layer. The outcomes were mainly binary and focused on metabolic disorders. A large diversity of statistical strategies were reported to integrate chemical mixtures and endogenous biomarkers to characterize their associations with health conditions. Multivariate regression models were the most predominant statistical method reported in the published workflows, however some studies applied latent based methods or multipollutant models to overcome the specific constraints of omic or exposure data. A minority of studies used formal mediation analysis to characterize the indirect effects mediated by the endogenous biomarkers. The principles of each specific statistical method and overall workflow set-up are summarized in the light of highlighting their applicability, strengths and weaknesses or interpretability to gain insight into the causal structures underlying the triad: exposure, effect-biomarker and outcome.
Assuntos
Poluentes Ambientais , Humanos , Poluentes Ambientais/análise , Saúde Ambiental , Poluentes Orgânicos Persistentes , Biomarcadores , Exposição Ambiental/análiseRESUMO
In nutrition and health research, untargeted metabolomics is actually analyzed simultaneously with clinical data to improve prediction and better understand pathological status. This can be modeled using a multiblock supervised model with several input data blocks (metabolomics, clinical data) being potential predictors of the outcome to be explained. Alternatively, this configuration can be represented with a path diagram where the input blocks are each connected by links directed to the outcome-as in multiblock supervised modeling-and are also related to each other, thus allowing one to account for block effects. On the basis of a path model, we show herein how to estimate the effect of an input block, either on its own or conditionally to other(s), on the output response, respectively called "global" and "partial" effects, by percentages of explained variance in dedicated PLS regression models. These effects have been computed in two different path diagrams in a case study relative to metabolic syndrome, involving metabolomics and clinical data from an older men's cohort (NuAge). From the two effects associated with each path, the results highlighted the complementary information provided by metabolomics to clinical data and, reciprocally, in the metabolic syndrome exploration.
RESUMO
Brochothrix thermosphacta is considered as a major spoiler of meat and seafood products. This study explores the biofilm formation ability and the biofilm structural diversity of 30 multi-origin B. thermosphacta strains using a set of complementary biofilm assays (biofilm ring test, crystal violet staining, and confocal laser scanning microscopy). Two major groups corresponding to low and high biofilm producers were identified. High biofilm producers presented flat architectures characterized by high surface coverage, high cell biovolume, and high surface area.
RESUMO
Humans are exposed daily to complex mixtures of chemical pollutants through their environment and diet, some of which have the potential to disrupt the bodies' natural endocrine functions and contribute to reproductive diseases like endometriosis. Increasing epidemiological and experimental evidence supports the association between endometriosis and certain persistent organic pollutants (POPs) like dioxins; however, little is known about the underlying linking mechanisms. The main objective of this study is to proof the methodological applicability and discovery potential of integrating ultra-trace mass spectrometry (MS) profiling of POP biomarkers and endogenous biomarker profiling (MS metabolomics and cytokines) in a case-control study for the etiological research of endometriosis. The approach is applied in a pilot clinical-based study conducted in France where women with and without surgically confirmed endometriosis were recruited. Serum samples were analysed with high-resolution MS for about 30 polychlorinated biphenyls (PCBs), organochlorinated pesticides and perfluoroalkyl substances (PFAS). About 600 serum metabolites and lipids were identified with targeted metabolomics using tandem MS with the Biocrates MxP® Quant 500 Kit. A panel of 4 pro-inflammatory cytokines were analysed using ELISA-based 4-PLEX analyser. Statistical analysis included a battery of variable selection approaches, multivariate logistic regression for single-chemical associations, Bayesian kernel machine regressions (BKMR) to identify mixture effects of POPs and a multiblock approach to identify shared biomarker signatures among high risk clusters. The results showed the positive associations between some POPs and endometriosis risk, including the pesticide trans-nonachlor Odds Ratio (95% Confidence Interval) 3.38 (2.06-5.98), p < 0.0001 and PCB 114 OR (95% CI) 1.83 (1.17-2.93), p = 0.009. The BKMR approach showed a tendency of a positive cumulative effect of the mixture, however trans-nonachlor exhibited significant associations within the mixture and interacted with other PCBs, strengthening the effects at highest concentrations. Finally, the multiblock analysis, relating the various blocks of data, revealed a latent cluster of women with higher risk of endometrioma presenting higher concentrations of trans-nonachlor, PCB 114 and dioxin-like toxic equivalents from PCBs, together with an increased inflammatory profile (i.e. elevated interleukin-8 and monocyte chemoattractant protein-1). It was also highlighted a specific metabolic pattern characterized by dysregulation of bile acid homeostasis and lipase activity. Further research will be required with larger sample size to confirm these findings and gain insight on the underlying mechanisms between POPs and endometriosis.
Assuntos
Endometriose , Poluentes Ambientais , Bifenilos Policlorados , Teorema de Bayes , Estudos de Casos e Controles , Citocinas , Endometriose/induzido quimicamente , Poluentes Ambientais/análise , Poluentes Ambientais/toxicidade , Feminino , Humanos , Poluentes Orgânicos PersistentesRESUMO
Lightly preserved seafood products, such as cold-smoked fish and fish gravlax, are traditionally consumed in Europe and are of considerable economic importance. This work aimed to compare three products that were obtained from the same batch of fish: cold-smoked salmon (CSS) stored under vacuum packaging (VP) or a modified atmosphere packaging (MAP) and VP salmon dill gravlax (SG). Classical microbiological analyses and 16S rRNA metabarcoding, biochemical analyses (trimethylamine, total volatile basic nitrogen (TVBN), biogenic amines, pH, volatile organic compounds (VOCs)) and sensory analyses (quantitative descriptive analysis) were performed on each product throughout their storage at a chilled temperature. The three products shared the same initial microbiota, which were mainly dominated by Photobacterium, Lactococcus and Lactobacillus genera. On day 28, the VP CSS ecosystem was mainly composed of Photobacterium and, to a lesser extent, Lactococcus and Lactobacillus genera, while Lactobacillus was dominant in the MAP CSS. The diversity was higher in the SG, which was mainly dominated by Enterobacteriaceae, Photobacterium, Lactobacillus and Lactococcus. Although the sensory spoilage was generally weak, gravlax was the most perishable product (slight increase in amine and acidic off-odors and flavors, fatty appearance, slight discoloration and drop in firmness), followed by the VP CSS, while the MAP CSS did not spoil. Spoilage was associated with an increase in the TVBN, biogenic amines and spoilage associated VOCs, such as decanal, nonanal, hexadecanal, benzaldehyde, benzeneacetaldehyde, ethanol, 3-methyl-1-butanol, 2,3-butanediol, 1-octen-3-ol, 2-butanone and 1-octen-3-one. This study showed that the processing and packaging conditions both had an effect on the microbial composition and the quality of the final product.
RESUMO
Endometriosis is a gynaecological disease characterised by the presence of endometriotic tissue outside of the uterus impacting a significant fraction of women of childbearing age. Evidence from epidemiological studies suggests a relationship between risk of endometriosis and exposure to some organochlorine persistent organic pollutants (POPs). However, these chemicals are numerous and occur in complex and highly correlated mixtures, and to date, most studies have not accounted for this simultaneous exposure. Linear and logistic regression models are constrained to adjusting for multiple exposures when variables are highly intercorrelated, resulting in unstable coefficients and arbitrary findings. Advanced machine learning models, of emerging use in epidemiology, today appear as a promising option to address these limitations. In this study, different machine learning techniques were compared on a dataset from a case-control study conducted in France to explore associations between mixtures of POPs and deep endometriosis. The battery of models encompassed regularised logistic regression, artificial neural network, support vector machine, adaptive boosting, and partial least-squares discriminant analysis with some additional sparsity constraints. These techniques were applied to identify the biomarkers of internal exposure in adipose tissue most associated with endometriosis and to compare model classification performance. The five tested models revealed a consistent selection of most associated POPs with deep endometriosis, including octachlorodibenzofuran, cis-heptachlor epoxide, polychlorinated biphenyl 77 or trans-nonachlor, among others. The high classification performance of all five models confirmed that machine learning may be a promising complementary approach in modelling highly correlated exposure biomarkers and their associations with health outcomes. Regularised logistic regression provided a good compromise between the interpretability of traditional statistical approaches and the classification capacity of machine learning approaches. Applying a battery of complementary algorithms may be a strategic approach to decipher complex exposome-health associations when the underlying structure is unknown.
Assuntos
Algoritmos , Endometriose/epidemiologia , Exposição Ambiental/estatística & dados numéricos , Poluentes Ambientais , Estudos de Casos e Controles , Feminino , França , Humanos , Aprendizado de MáquinaRESUMO
This paper describes data collected on 2 sets of 8 French red wines from two grape varieties: Pinot Noir (PN) and Cabernet Franc (CF). It provides, for the 16 wines, (i) sensory descriptive data obtained with a trained panel, (ii) volatile organic compounds (VOC) quantification data obtained by Headspace Solid Phase Micro-Extraction - Gas Chromatography - Mass Spectrometry (HS-SPME-GC-MS) and (iii) odor-active compounds identification by Headspace Solid Phase Micro-Extraction - Gas Chromatography - Mass Spectrometry - Olfactometry (HS-SPME-GC-MS-O). The raw data are hosted on an open-access research data repository [1].
RESUMO
OBJECTIVE: To estimate whether the "Diagnostic and Statistical Manual of Mental Disorders" (DSM) is biologically accurate for the diagnosis of Attention Deficit/ Hyperactivity Disorder (ADHD) using a biological-based classifier built by a special method of multivariate analysis of a large dataset of a small sample (much more variables than subjects), holding neurophysiological, behavioral, and psychological variables. METHODS: Twenty typically developing boys and 19 boys diagnosed with ADHD, aged 10-13 years, were examined using the Attentional Network Test (ANT) with recordings of event-related potentials (ERPs). From 774 variables, a reduced number of latent variables (LVs) were extracted with a clustering of variables method (CLV), for further reclassification of subjects using the k-means method. This approach allowed a multivariate analysis to be applied to a significantly larger number of variables than the number of cases. RESULTS: From datasets including ERPs from the mid-frontal, mid-parietal, right frontal, and central scalp areas, we found 82% of agreement between DSM and biological-based classifications. The kappa index between DSM and behavioral/psychological/neurophysiological data was 0.75, which is regarded as a "substantial level of agreement". DISCUSSION: The CLV is a useful method for multivariate analysis of datasets with much less subjects than variables. In this study, a correlation is found between the biological-based classifier and the DSM outputs for the classification of subjects as either ADHD or not. This result suggests that DSM clinically describes a biological condition, supporting its validity for ADHD diagnostics.
RESUMO
Varying the crushing parameters in a model mouth apparatus gave different crushed apple samples, which were compared to apples crushed in the human mouth by six people. An image analysis method was developed to measure the similarity between apple particles after crushing in the artificial mouth and in the human mouth. Thus, experimental conditions were determined that produced fruit in a state closest to that obtained after mastication in a human mouth. The influence of these different conditions on the quantity of released volatile compounds was then studied.
Assuntos
Frutas/química , Malus/química , Mastigação , Boca , Humanos , Técnicas In Vitro , Reação de Maillard , Modelos Biológicos , Odorantes/análise , VolatilizaçãoRESUMO
The classification of microorganisms by high-dimensional phenotyping methods such as FTIR spectroscopy is often a complicated process due to the complexity of microbial phylogenetic taxonomy. A hierarchical structure developed for such data can often facilitate the classification analysis. The hierarchical tree structure can either be imposed to a given set of phenotypic data by integrating the phylogenetic taxonomic structure or set up by revealing the inherent clusters in the phenotypic data. In this study, we wanted to compare different approaches to hierarchical classification of microorganisms based on high-dimensional phenotypic data. A set of 19 different species of molds (filamentous fungi) obtained from the mycological strain collection of the Norwegian Veterinary Institute (Oslo, Norway) is used for the study. Hierarchical cluster analysis is performed for setting up the classification trees. Classification algorithms such as artificial neural networks (ANN), partial least-squared discriminant analysis and random forest (RF) are used and compared. The 2 methods ANN and RF outperformed all the other approaches even though they did not utilize predefined hierarchical structure. To our knowledge, the RF approach is used here for the first time to classify microorganisms by FTIR spectroscopy.
Assuntos
Classificação/métodos , Fungos/classificação , Fenótipo , Análise Discriminante , Análise dos Mínimos Quadrados , Redes Neurais de Computação , FilogeniaRESUMO
Increasing activity along the French Atlantic coast has led to chronic pollution with, in particular, mixtures of contaminants such as hydrocarbons, phytosanitary products, PCBs and heavy metals. Based on previous research, pollution biomarkers were used in this study as they can indicate health status when monitoring the impact of pollutants on coastal species such as the marine bivalve Mimachlamys varia. Mollusc bivalves were sampled in March 2016, in open and semi-open areas (a harbour zone), from thirteen sites which differed in terms of their level of pollution, and were located along the Atlantic coast from Brittany down to the Nouvelle-Aquitaine region. First, analyses of heavy metals and organic contaminants (e.g. pesticides, polycyclic aromatic hydrocarbons, polychlorobiphenyl) in the digestive gland of bivalves were performed. Second, biochemical assays were used to study defence biomarkers: oxidative stress with Superoxide Dismutase (SOD), detoxification of organic compounds with Glutathione-S Transferase (GST), lipid peroxidation with Malondialdehyde (MDA), and immune processes with Laccase. In addition to the biochemical assays, a genetic approach was used to measure genetic diversity (haplotype and nucleotide diversity) at each site. Biomarker assays and genetic diversity were correlated with the chemical contaminants in bivalves using the Path-ComDim statistical model. Our results showed specific correlations between biochemical assays in the digestive glands with heavy metal contaminants, and between genetic diversity and organic pollution. Blocks of responses were analysed for correlations in order to develop standardized tools and guidelines that could improve our understanding of the short-term and long-term impact of contaminants on physiological parameters.