Búsqueda | Portal de Búsqueda de la BVS Ecuador

1.

Markers of platelet activation foR identification of late onset sEpsis in infaNTs: PARENT study protocol.

O'Reilly, Daniel; Murphy, Claire Anne; Moore, Carmel Maria; Ní Áinle, Fionnuala; Gormley, Isobel Claire; Morrell, Craig N; Curley, Anna; Mc Callion, Naomi; Maguire, Patricia.

Pediatr Res ; 95(3): 852-856, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-37758864

RESUMEN

BACKGROUND: Newborns are at high risk of sepsis. At present there is no definitive "rule in" blood test for sepsis at the point of clinical concern. A positive blood culture remains the gold standard test for neonatal sepsis, however laboratory markers that correlate prospectively with culture positive sepsis could aid clinicians in making decisions regarding administration of empiric antibiotic therapies. METHODS: This multi-site, prospective observational study will take place in two neonatal intensive care units (National Maternity Hospital and Rotunda Hospital, Dublin). Neonates born at less than 34 weeks will be enroled and informed consent obtained prior to late onset sepsis work up. If at any point subsequently during their neonatal intensive care stay they develop signs and symptoms of possible sepsis requiring blood culture, an additional sodium citrate sample will be obtained. Infants will be categorised into three groups as follows: (i) culture positive sepsis, (ii) culture negative sepsis where an infant receives 5 days of antibiotics (iii) non sepsis. Our primary outcome is to establish if differential platelet/endothelial activation can prospectively identify neonatal culture positive late onset sepsis. TRIAL REGISTRATION NUMBER: NCT05530330 IMPACT: Preterm infants are a high risk group for the development of sepsis which is a major cause of mortality in this population. Platelets have been associated with host response to invasive bacterial infections both in animal models and translational work. A positive blood culture is the gold standard test for neonatal sepsis but can be unreliable due to limited blood sampling in the very low birth weight population. This study hopes to establish if platelet/endothelial associated plasma proteins can prospectively identify late onset neonatal sepsis.

Asunto(s)

Infecciones Bacterianas , Sepsis Neonatal , Sepsis , Femenino , Humanos , Lactante , Recién Nacido , Embarazo , Antibacterianos/uso terapéutico , Recien Nacido Prematuro , Unidades de Cuidado Intensivo Neonatal , Sepsis Neonatal/diagnóstico , Estudios Observacionales como Asunto , Activación Plaquetaria , Sepsis/epidemiología , Estudios Prospectivos , Estudios Multicéntricos como Asunto

2.

multiMarker: software for modelling and prediction of continuous food intake using multiple biomarkers measurements.

D'Angelo, Silvia; Gormley, Isobel Claire; McNamara, Aoife E; Brennan, Lorraine.

BMC Bioinformatics ; 22(1): 469, 2021 Sep 28.

Artículo en Inglés | MEDLINE | ID: mdl-34583648

RESUMEN

BACKGROUND: Metabolomic biomarkers offer potential for objective and reliable food intake assessment, and there is growing interest in using biomarkers in place of or with traditional self-reported approaches. Ongoing research suggests that multiple biomarkers are associated with single foods, offering great sensitivity and specificity. However, currently there is a dearth of methods to model the relationship between multiple biomarkers and single food intake measurements. RESULTS: Here, we introduce multiMarker, a web-based application based on the homonymous R package, that enables one to infer the relationship between food intake and two or more metabolomic biomarkers. Furthermore, multiMarker allows prediction of food intake from biomarker data alone. multiMarker differs from previous approaches by providing distributions of predicted intakes, directly accounting for uncertainty in food intake quantification. Usage of both the R package and the web application is demonstrated using real data concerning three biomarkers for orange intake. Further, example data is pre-loaded in the web application to enable users to examine multiMarker's functionality. CONCLUSION: The proposed software advance the field of Food Intake Biomarkers providing researchers with a novel tool to perform continuous food intake quantification, and to assess its associated uncertainty, from multiple biomarkers. To facilitate widespread use of the framework, multiMarker has been implemented as an R package and a Shiny web application.

Asunto(s)

Metabolómica , Programas Informáticos , Biomarcadores , Ingestión de Alimentos , Humanos

3.

Computational modelling of chromosomally clustering protein domains in bacteria.

Cotroneo, Chiara E; Gormley, Isobel Claire; Shields, Denis C; Salter-Townshend, Michael.

BMC Bioinformatics ; 22(1): 593, 2021 Dec 14.

Artículo en Inglés | MEDLINE | ID: mdl-34906073

RESUMEN

BACKGROUND: In bacteria, genes with related functions-such as those involved in the metabolism of the same compound or in infection processes-are often physically close on the genome and form groups called clusters. The enrichment of such clusters over various distantly related bacteria can be used to predict the roles of genes of unknown function that cluster with characterised genes. There is no obvious rule to define a cluster, given their variability in size and intergenic distances, and the definition of what comprises a "gene", since genes can gain and lose domains over time. Protein domains can cluster within a gene, or in adjacent genes of related function, and in both cases these are chromosomally clustered. Here, we model the distances between pairs of protein domain coding regions across a wide range of bacteria and archaea via a probabilistic two component mixture model, without imposing arbitrary thresholds in terms of gene numbers or distances. RESULTS: We trained our model using matched gene ontology terms to label functionally related pairs and assess the stability of the parameters of the model across 14,178 archaeal and bacterial strains. We found that the parameters of our mixture model are remarkably stable across bacteria and archaea, except for endosymbionts and obligate intracellular pathogens. Obligate pathogens have smaller genomes, and although they vary, on average do not show noticeably different clustering distances; the main difference in the parameter estimates is that a far greater proportion of the genes sharing ontology terms are clustered. This may reflect that these genomes are enriched for complexes encoded by clustered core housekeeping genes, as a proportion of the total genes. Given the overall stability of the parameter estimates, we then used the mean parameter estimates across the entire dataset to investigate which gene ontology terms are most frequently associated with clustered genes. CONCLUSIONS: Given the stability of the mixture model across species, it may be used to predict bacterial gene clusters that are shared across multiple species, in addition to giving insights into the evolutionary pressures on the chromosomal locations of genes in different species.

Asunto(s)

Genoma Arqueal , Genoma Bacteriano , Archaea/genética , Bacterias/genética , Análisis por Conglomerados , Simulación por Computador , Evolución Molecular , Filogenia , Dominios Proteicos

4.

Genome-wide association analyses of carcass traits using copy number variants and raw intensity values of single nucleotide polymorphisms in cattle.

Rafter, Pierce; Gormley, Isobel Claire; Purfield, Deirdre; Parnell, Andrew C; Naderi, Saeid; Berry, Donagh P.

BMC Genomics ; 22(1): 757, 2021 Oct 23.

Artículo en Inglés | MEDLINE | ID: mdl-34688258

RESUMEN

BACKGROUND: The carcass value of cattle is a function of carcass weight and quality. Given the economic importance of carcass merit to producers, it is routinely included in beef breeding objectives. A detailed understanding of the genetic variants that contribute to carcass merit is useful to maximize the efficiency of breeding for improved carcass merit. The objectives of the present study were two-fold: firstly, to perform genome-wide association analyses of carcass weight, carcass conformation, and carcass fat using copy number variant (CNV) data in a population of 923 Holstein-Friesian, 945 Charolais, and 974 Limousin bulls; and secondly to perform separate association analyses of carcass traits on the same population of cattle using the Log R ratio (LRR) values of 712,555 single nucleotide polymorphisms (SNPs). The LRR value of a SNP is a measure of the signal intensity of the SNP generated during the genotyping process. RESULTS: A total of 13,969, 3,954, and 2,805 detected CNVs were tested for association with the three carcass traits for the Holstein-Friesian, Charolais, and Limousin, respectively. The copy number of 16 CNVs and the LRR of 34 SNPs were associated with at least one of the three carcass traits in at least one of the three cattle breeds. With the exception of three SNPs, none of the quantitative trait loci detected in the CNV association analyses or the SNP LRR association analyses were also detected using traditional association analyses based on SNP allele counts. Many of the CNVs and SNPs associated with the carcass traits were located near genes related to the structure and function of the spliceosome and the ribosome; in particular, U6 which encodes a spliceosomal subunit and 5S rRNA which encodes a ribosomal subunit. CONCLUSIONS: The present study demonstrates that CNV data and SNP LRR data can be used to detect genomic regions associated with carcass traits in cattle providing information on quantitative trait loci over and above those detected using just SNP allele counts, as is the approach typically employed in genome-wide association analyses.

Asunto(s)

Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Animales , Bovinos/genética , Variaciones en el Número de Copia de ADN , Masculino , Fenotipo , Sitios de Carácter Cuantitativo

5.

Concordance rate between copy number variants detected using either high- or medium-density single nucleotide polymorphism genotype panels and the potential of imputing copy number variants from flanking high density single nucleotide polymorphism haplotypes in cattle.

Rafter, Pierce; Gormley, Isobel Claire; Parnell, Andrew C; Kearney, John Francis; Berry, Donagh P.

BMC Genomics ; 21(1): 205, 2020 Mar 04.

Artículo en Inglés | MEDLINE | ID: mdl-32131735

RESUMEN

BACKGROUND: The trading of individual animal genotype information often involves only the exchange of the called genotypes and not necessarily the additional information required to effectively call structural variants. The main aim here was to determine if it is possible to impute copy number variants (CNVs) using the flanking single nucleotide polymorphism (SNP) haplotype structure in cattle. While this objective was achieved using high-density genotype panels (i.e., 713,162 SNPs), a secondary objective investigated the concordance of CNVs called with this high-density genotype panel compared to CNVs called from a medium-density panel (i.e., 45,677 SNPs in the present study). This is the first study to compare CNVs called from high-density and medium-density SNP genotypes from the same animals. High (and medium-density) genotypes were available on 991 Holstein-Friesian, 1015 Charolais, and 1394 Limousin bulls. The concordance between CNVs called from the medium-density and high-density genotypes were calculated separately for each animal. A subset of CNVs which were called from the high-density genotypes was selected for imputation. Imputation was carried out separately for each breed using a set of high-density SNPs flanking the midpoint of each CNV. A CNV was deemed to be imputed correctly when the called copy number matched the imputed copy number. RESULTS: For 97.0% of CNVs called from the high-density genotypes, the corresponding genomic position on the medium-density of the animal did not contain a called CNV. The average accuracy of imputation for CNV deletions was 0.281, with a standard deviation of 0.286. The average accuracy of imputation of the CNV normal state, i.e. the absence of a CNV, was 0.982 with a standard deviation of 0.022. Two CNV duplications were imputed in the Charolais, a single CNV duplication in the Limousins, and a single CNV duplication in the Holstein-Friesians; in all cases the CNV duplications were incorrectly imputed. CONCLUSION: The vast majority of CNVs called from the high-density genotypes were not detected using the medium-density genotypes. Furthermore, CNVs cannot be accurately predicted from flanking SNP haplotypes, at least based on the imputation algorithms routinely used in cattle, and using the SNPs currently available on the high-density genotype panel.

Asunto(s)

Biología Computacional/métodos , Variaciones en el Número de Copia de ADN , Polimorfismo de Nucleótido Simple , Algoritmos , Alelos , Animales , Bovinos , Frecuencia de los Genes , Genotipo , Haplotipos

6.

BINDER: computationally inferring a gene regulatory network for Mycobacterium abscessus.

Staunton, Patrick M; Miranda-CasoLuengo, Aleksandra A; Loftus, Brendan J; Gormley, Isobel Claire.

BMC Bioinformatics ; 20(1): 466, 2019 Sep 10.

Artículo en Inglés | MEDLINE | ID: mdl-31500560

RESUMEN

BACKGROUND: Although many of the genic features in Mycobacterium abscessus have been fully validated, a comprehensive understanding of the regulatory elements remains lacking. Moreover, there is little understanding of how the organism regulates its transcriptomic profile, enabling cells to survive in hostile environments. Here, to computationally infer the gene regulatory network for Mycobacterium abscessus we propose a novel statistical computational modelling approach: BayesIan gene regulatory Networks inferreD via gene coExpression and compaRative genomics (BINDER). In tandem with derived experimental coexpression data, the property of genomic conservation is exploited to probabilistically infer a gene regulatory network in Mycobacterium abscessus.Inference on regulatory interactions is conducted by combining 'primary' and 'auxiliary' data strata. The data forming the primary and auxiliary strata are derived from RNA-seq experiments and sequence information in the primary organism Mycobacterium abscessus as well as ChIP-seq data extracted from a related proxy organism Mycobacterium tuberculosis. The primary and auxiliary data are combined in a hierarchical Bayesian framework, informing the apposite bivariate likelihood function and prior distributions respectively. The inferred relationships provide insight to regulon groupings in Mycobacterium abscessus. RESULTS: We implement BINDER on data relating to a collection of 167,280 regulator-target pairs resulting in the identification of 54 regulator-target pairs, across 5 transcription factors, for which there is strong probability of regulatory interaction. CONCLUSIONS: The inferred regulatory interactions provide insight to, and a valuable resource for further studies of, transcriptional control in Mycobacterium abscessus, and in the family of Mycobacteriaceae more generally. Further, the developed BINDER framework has broad applicability, useable in settings where computational inference of a gene regulatory network requires integration of data sources derived from both the primary organism of interest and from related proxy organisms.

Asunto(s)

Biología Computacional/métodos , Redes Reguladoras de Genes , Mycobacterium abscessus/genética , Programas Informáticos , Área Bajo la Curva , Bacterias/genética , Simulación por Computador , Regulación Bacteriana de la Expresión Génica , Curva ROC , Regulón/genética

7.

MetSizeR: selecting the optimal sample size for metabolomic studies using an analysis based approach.

Nyamundanda, Gift; Gormley, Isobel Claire; Fan, Yue; Gallagher, William M; Brennan, Lorraine.

BMC Bioinformatics ; 14: 338, 2013 Nov 21.

Artículo en Inglés | MEDLINE | ID: mdl-24261687

RESUMEN

BACKGROUND: Determining sample sizes for metabolomic experiments is important but due to the complexity of these experiments, there are currently no standard methods for sample size estimation in metabolomics. Since pilot studies are rarely done in metabolomics, currently existing sample size estimation approaches which rely on pilot data can not be applied. RESULTS: In this article, an analysis based approach called MetSizeR is developed to estimate sample size for metabolomic experiments even when experimental pilot data are not available. The key motivation for MetSizeR is that it considers the type of analysis the researcher intends to use for data analysis when estimating sample size. MetSizeR uses information about the data analysis technique and prior expert knowledge of the metabolomic experiment to simulate pilot data from a statistical model. Permutation based techniques are then applied to the simulated pilot data to estimate the required sample size. CONCLUSIONS: The MetSizeR methodology, and a publicly available software package which implements the approach, are illustrated through real metabolomic applications. Sample size estimates, informed by the intended statistical analysis technique, and the associated uncertainty are provided.

Asunto(s)

Metabolómica/estadística & datos numéricos , Algoritmos , Animales , Simulación por Computador , Estudios Longitudinales , Modelos Estadísticos , Resonancia Magnética Nuclear Biomolecular/métodos , Proyectos Piloto , Análisis de Componente Principal/normas , Tamaño de la Muestra , Programas Informáticos

8.

MetaboVariation: Exploring Individual Variation in Metabolite Levels.

Gupta, Shubbham; Gormley, Isobel Claire; Brennan, Lorraine.

Metabolites ; 13(2)2023 Jan 23.

Artículo en Inglés | MEDLINE | ID: mdl-36837783

RESUMEN

To date, most metabolomics biomarker research has focused on identifying disease biomarkers. However, there is a need for biomarkers of early metabolic dysfunction to identify individuals who would benefit from lifestyle interventions. Concomitantly, there is a need to develop strategies to analyse metabolomics data at an individual level. We propose "MetaboVariation", a method that models repeated measurements on individuals to explore fluctuations in metabolite levels at an individual level. MetaboVariation employs a Bayesian generalised linear model to flag individuals with intra-individual variations in their metabolite levels across multiple measurements. MetaboVariation models repeated metabolite levels as a function of explanatory variables while accounting for intra-individual variation. The posterior predictive distribution of metabolite levels at the individual level is available, and is used to flag individuals with observed metabolite levels outside the 95% highest posterior density prediction interval at a given time point. MetaboVariation was applied to a dataset containing metabolite levels for 20 metabolites, measured once every four months, in 164 individuals. A total of 28% of individuals with intra-individual variations in three or more metabolites were flagged. An R package for MetaboVariation was developed with an embedded R Shiny web application. To summarize, MetaboVariation has made considerable progress in developing strategies for analysing metabolomics data at the individual level, thus paving the way toward personalised healthcare.

9.

Selecting Milk Spectra to Develop Equations to Predict Milk Technological Traits.

Frizzarin, Maria; Gormley, Isobel Claire; Casa, Alessandro; McParland, Sinéad.

Foods ; 10(12)2021 Dec 11.

Artículo en Inglés | MEDLINE | ID: mdl-34945635

RESUMEN

Including all available data when developing equations to relate midinfrared spectra to a phenotype may be suboptimal for poorly represented spectra. Here, an alternative local changepoint approach was developed to predict six milk technological traits from midinfrared spectra. Neighbours were objectively identified for each predictand as those most similar to the predictand using the Mahalanobis distances between the spectral principal components, and subsequently used in partial least square regression (PLSR) analyses. The performance of the local changepoint approach was compared to that of PLSR using all spectra (global PLSR) and another LOCAL approach, whereby a fixed number of neighbours was used in the prediction according to the correlation between the predictand and the available spectra. Global PLSR had the lowest RMSEV for five traits. The local changepoint approach had the lowest RMSEV for one trait; however, it outperformed the LOCAL approach for four traits. When the 5% of the spectra with the greatest Mahalanobis distance from the centre of the global principal component space were analysed, the local changepoint approach outperformed the global PLSR and the LOCAL approach in two and five traits, respectively. The objective selection of neighbours improved the prediction performance compared to utilising a fixed number of neighbours; however, it generally did not outperform the global PLSR.

10.

The Contribution of Copy Number Variants and Single Nucleotide Polymorphisms to the Additive Genetic Variance of Carcass Traits in Cattle.

Rafter, Pierce; Gormley, Isobel Claire; Parnell, Andrew C; Naderi, Saeid; Berry, Donagh P.

Front Genet ; 12: 761503, 2021.

Artículo en Inglés | MEDLINE | ID: mdl-34795696

RESUMEN

The relative contributions of both copy number variants (CNVs) and single nucleotide polymorphisms (SNPs) to the additive genetic variance of carcass traits in cattle is not well understood. A detailed understanding of the relative importance of CNVs in cattle may have implications for study design of both genomic predictions and genome-wide association studies. The first objective of the present study was to quantify the relative contributions of CNV data and SNP genotype data to the additive genetic variance of carcass weight, fat, and conformation for 945 Charolais, 923 Holstein-Friesian, and 974 Limousin sires. The second objective was to jointly consider SNP and CNV data in a least absolute selection and shrinkage operator (LASSO) regression model to identify genomic regions associated with carcass weight, fat, and conformation within each of the three breeds separately. A genomic relationship matrix (GRM) based on just CNV data did not capture any variance in the three carcass traits when jointly evaluated with a SNP-derived GRM. In the LASSO regression analysis, a total of 987 SNPs and 18 CNVs were associated with at least one of the three carcass traits in at least one of the three breeds. The quantitative trait loci (QTLs) corresponding to the associated SNPs and CNVs overlapped with several candidate genes including previously reported candidate genes such as MSTN and RSAD2, and several potential novel candidate genes such as ACTN2 and THOC1. The results of the LASSO regression analysis demonstrated that CNVs can be used to detect associations with carcass traits which were not detected using the set of SNPs available in the present study. Therefore, the CNVs and SNPs available in the present study were not redundant forms of genomic data.

11.

Probabilistic principal component analysis for metabolomic data.

Nyamundanda, Gift; Brennan, Lorraine; Gormley, Isobel Claire.

BMC Bioinformatics ; 11: 571, 2010 Nov 23.

Artículo en Inglés | MEDLINE | ID: mdl-21092268

RESUMEN

BACKGROUND: Data from metabolomic studies are typically complex and high-dimensional. Principal component analysis (PCA) is currently the most widely used statistical technique for analyzing metabolomic data. However, PCA is limited by the fact that it is not based on a statistical model. RESULTS: Here, probabilistic principal component analysis (PPCA) which addresses some of the limitations of PCA, is reviewed and extended. A novel extension of PPCA, called probabilistic principal component and covariates analysis (PPCCA), is introduced which provides a flexible approach to jointly model metabolomic data and additional covariate information. The use of a mixture of PPCA models for discovering the number of inherent groups in metabolomic data is demonstrated. The jackknife technique is employed to construct confidence intervals for estimated model parameters throughout. The optimal number of principal components is determined through the use of the Bayesian Information Criterion model selection tool, which is modified to address the high dimensionality of the data. CONCLUSIONS: The methods presented are illustrated through an application to metabolomic data sets. Jointly modeling metabolomic data and covariates was successfully achieved and has the potential to provide deeper insight to the underlying data structure. Examination of confidence intervals for the model parameters, such as loadings, allows for principled and clear interpretation of the underlying data structure. A software package called MetabolAnalyze, freely available through the R statistical software, has been developed to facilitate implementation of the presented methods in the metabolomics field.

Asunto(s)

Metabolómica/métodos , Análisis de Componente Principal , Algoritmos , Bases de Datos Factuales

12.

Combining biomarker and self-reported dietary intake data: A review of the state of the art and an exposition of concepts.

Gormley, Isobel Claire; Bai, Yuxin; Brennan, Lorraine.

Stat Methods Med Res ; 29(2): 617-635, 2020 02.

Artículo en Inglés | MEDLINE | ID: mdl-30943855

RESUMEN

Classical approaches to assessing dietary intake are associated with measurement error. In an effort to address inherent measurement error in dietary self-reported data there is increased interest in the use of dietary biomarkers as objective measures of intake. Furthermore, there is a growing consensus of the need to combine dietary biomarker data with self-reported data. A review of state of the art techniques employed when combining biomarker and self-reported data is conducted. Two predominant methods, the calibration method and the method of triads, emerge as relevant techniques used when combining biomarker and self-reported data to account for measurement errors in dietary intake assessment. Both methods crucially assume measurement error independence. To expose and understand the performance of these methods in a range of realistic settings, their underpinning statistical concepts are unified and delineated, and thorough simulation studies are conducted. Results show that violation of the methods' assumptions negatively impacts resulting inference but that this impact is mitigated when the variation of the biomarker around the true intake is small. Thus there is much scope for the further development of biomarkers and models in tandem to achieve the ultimate goal of accurately assessing dietary intake.

Asunto(s)

Biomarcadores , Dieta , Ingestión de Alimentos , Autoinforme , Algoritmos , Calibración , Dieta/estadística & datos numéricos , Femenino , Humanos , Masculino

13.

Combining biomarker and food intake data: calibration equations for citrus intake.

D'Angelo, Silvia; Gormley, Isobel Claire; McNulty, Breige A; Nugent, Anne P; Walton, Janette; Flynn, Albert; Brennan, Lorraine.

Am J Clin Nutr ; 110(4): 977-983, 2019 10 01.

Artículo en Inglés | MEDLINE | ID: mdl-31432078

RESUMEN

BACKGROUND: Measurement error associated with self-reported dietary intake is a well-documented issue. Combining biomarkers of food intake and dietary intake data is a high priority. OBJECTIVES: The aim of this study was to develop calibration equations for food intake, illustrated with an application for citrus intake. Further, a simulation-based framework was developed to determine the portion of biomarker data needed for stable calibration equation estimation in large population studies. METHODS: Calibration equations were developed using mean daily self-reported citrus intake (4-d semiweighed food diaries) and biomarker-derived intake (urinary proline betaine biomarker) data from participants (n = 565) as part of a cross-sectional study. Different functional specifications and biomarker transformations were tested to derive the optimal calibration equation specifications. The simulation study was developed using linear regression for the calibration equations. Stability in the calibration equation estimations was investigated for varying portions of biomarker and intake data "qualities." RESULTS: With citrus intake, linear regression on nontransformed biomarker data resulted in the optimal calibration equation specifications and produced good-quality predicted intakes. The lowest mean squared error (14,354) corresponded to a linear regression model, defined with biomarker-derived estimates of intakes on the original scale. Using this model in a subpopulation without biomarker data resulted in an average mean ± SD citrus intake of 81 ± 66 g/d. The simulation study suggested that in large population studies, biomarker data on 20-30% of the subjects are required to guarantee stable estimation of calibration equations. This article is accompanied by a web application ("Bio-Intake"), which was developed to facilitate measurement error correction in self-reported mean daily citrus intake data. CONCLUSIONS: Calibration equations proved to be a useful instrument to correct measurement error in self-reported food intake data. The simulation study demonstrated that the use of food intake biomarkers may be feasible and beneficial in the context of large population studies.

Asunto(s)

Citrus , Dieta , Conducta Alimentaria , Proyectos de Investigación , Biomarcadores , Humanos

14.

CLUSTERING SOUTH AFRICAN HOUSEHOLDS BASED ON THEIR ASSET STATUS USING LATENT VARIABLE MODELS.

McParland, Damien; Gormley, Isobel Claire; McCormick, Tyler H; Clark, Samuel J; Kabudula, Chodziwadziwa Whiteson; Collinson, Mark A.

Ann Appl Stat ; 8(2): 747-776, 2014 Jun 01.

Artículo en Inglés | MEDLINE | ID: mdl-25485026

RESUMEN

The Agincourt Health and Demographic Surveillance System has since 2001 conducted a biannual household asset survey in order to quantify household socio-economic status (SES) in a rural population living in northeast South Africa. The survey contains binary, ordinal and nominal items. In the absence of income or expenditure data, the SES landscape in the study population is explored and described by clustering the households into homogeneous groups based on their asset status. A model-based approach to clustering the Agincourt households, based on latent variable models, is proposed. In the case of modeling binary or ordinal items, item response theory models are employed. For nominal survey items, a factor analysis model, similar in nature to a multinomial probit model, is used. Both model types have an underlying latent variable structure-this similarity is exploited and the models are combined to produce a hybrid model capable of handling mixed data types. Further, a mixture of the hybrid models is considered to provide clustering capabilities within the context of mixed binary, ordinal and nominal response data. The proposed model is termed a mixture of factor analyzers for mixed data (MFA-MD). The MFA-MD model is applied to the survey data to cluster the Agincourt households into homogeneous groups. The model is estimated within the Bayesian paradigm, using a Markov chain Monte Carlo algorithm. Intuitive groupings result, providing insight to the different socio-economic strata within the Agincourt region.

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA