Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
1.
BMC Genomics ; 13: 340, 2012 Jul 24.
Artículo en Inglés | MEDLINE | ID: mdl-22827772

RESUMEN

BACKGROUND: Pre-symptomatic prediction of disease and drug response based on genetic testing is a critical component of personalized medicine. Previous work has demonstrated that the predictive capacity of genetic testing is constrained by the heritability and prevalence of the tested trait, although these constraints have only been approximated under the assumption of a normally distributed genetic risk distribution. RESULTS: Here, we mathematically derive the absolute limits that these factors impose on test accuracy in the absence of any distributional assumptions on risk. We present these limits in terms of the best-case receiver-operating characteristic (ROC) curve, consisting of the best-case test sensitivities and specificities, and the AUC (area under the curve) measure of accuracy. We apply our method to genetic prediction of type 2 diabetes and breast cancer, and we additionally show the best possible accuracy that can be obtained from integrated predictors, which can incorporate non-genetic features. CONCLUSION: Knowledge of such limits is valuable in understanding the implications of genetic testing even before additional associations are identified.


Asunto(s)
Neoplasias de la Mama/genética , Diabetes Mellitus Tipo 2/genética , Genoma Humano , Modelos Genéticos , Área Bajo la Curva , Neoplasias de la Mama/diagnóstico , Simulación por Computador , Diabetes Mellitus Tipo 2/diagnóstico , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Medicina de Precisión , Valor Predictivo de las Pruebas , Pronóstico , Curva ROC
2.
J Biol Chem ; 286(40): 34497-503, 2011 Oct 07.
Artículo en Inglés | MEDLINE | ID: mdl-21768115

RESUMEN

Smooth muscle cells (SMCs) have a pivotal role in cardiovascular diseases and are responsible for hyaluronan (HA) deposition in thickening vessel walls. HA regulates SMC proliferation, migration, and inflammation, which accelerates neointima formation. We used the HA synthesis inhibitor 4-methylumbelliferone (4-MU) to reduce HA production in human aortic SMCs and found a significant increase of apoptotic cells. Interestingly, the exogenous addition of HA together with 4-MU reduced apoptosis. A similar anti-apoptotic effect was observed also by adding other glycosaminoglycans and glucose to 4-MU-treated cells. Furthermore, the anti-apoptotic effect of HA was mediated by Toll-like receptor 4, CD44, and PI3K but not by ERK1/2.


Asunto(s)
Aorta/patología , Apoptosis , Glucosa/metabolismo , Glicosaminoglicanos/metabolismo , Ácido Hialurónico/farmacología , Himecromona/análogos & derivados , Miocitos del Músculo Liso/citología , Movimiento Celular , Proliferación Celular , Glicoproteínas/metabolismo , Humanos , Receptores de Hialuranos/biosíntesis , Himecromona/farmacología , Inflamación , Proteína Quinasa 1 Activada por Mitógenos/metabolismo , Proteína Quinasa 3 Activada por Mitógenos/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Fosfatidilinositol 3-Quinasas/metabolismo , Receptor Toll-Like 4/metabolismo
3.
J Biomed Inform ; 44(4): 565-75, 2011 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-21310265

RESUMEN

Feedback control is an important regulatory process in biological systems, which confers robustness against external and internal disturbances. Genes involved in feedback structures are therefore likely to have a major role in regulating cellular processes. Here we rely on a dynamic Bayesian network approach to identify feedback loops in cell cycle regulation. We analyzed the transcriptional profile of the cell cycle in HeLa cancer cells and identified a feedback loop structure composed of 10 genes. In silico analyses showed that these genes hold important roles in system's dynamics. The results of published experimental assays confirmed the central role of 8 of the identified feedback loop genes in cell cycle regulation. In conclusion, we provide a novel approach to identify critical genes for the dynamics of biological processes. This may lead to the identification of therapeutic targets in diseases that involve perturbations of these dynamics.


Asunto(s)
Ciclo Celular/genética , Biología Computacional/métodos , Retroalimentación Fisiológica/fisiología , Expresión Génica , Redes Reguladoras de Genes , Teorema de Bayes , Simulación por Computador , Bases de Datos Genéticas , Células HeLa , Humanos , Modelos Biológicos
4.
Cancer ; 117(2): 353-60, 2011 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-20839314

RESUMEN

BACKGROUND: Transcriptional networks play a central role in cancer development. The authors described a systems biology approach to cancer classification based on the reverse engineering of the transcriptional network surrounding the 2 most common types of lung cancer: adenocarcinoma (AC) and squamous cell carcinoma (SCC). METHODS: A transcriptional network classifier was inferred from the molecular profiles of 111 human lung carcinomas. The authors tested its classification accuracy in 7 independent cohorts, for a total of 422 subjects of Caucasian, African, and Asian descent. RESULTS: The model for distinguishing AC from SCC was a 25-gene network signature. Its performance on the 7 independent cohorts achieved 95.2% classification accuracy. Even more surprisingly, 95% of this accuracy was explained by the interplay of 3 genes (KRT6A, KRT6B, KRT6C) on a narrow cytoband of chromosome 12. The role of this chromosomal region in distinguishing AC and SCC was further confirmed by the analysis of another group of 28 independent subjects assayed by DNA copy number changes. The copy number variations of bands 12q12, 12q13, and 12q12-13 discriminated these samples with 84% accuracy. CONCLUSIONS: These results suggest the existence of a robust signature localized in a relatively small area of the genome, and show the clinical potential of reverse engineering transcriptional networks from molecular profiles.


Asunto(s)
Adenocarcinoma/genética , Carcinoma de Células Escamosas/genética , Redes Reguladoras de Genes , Neoplasias Pulmonares/clasificación , Neoplasias Pulmonares/genética , Teorema de Bayes , Cromosomas Humanos Par 12 , Humanos , Biología de Sistemas/métodos
5.
Artículo en Inglés | MEDLINE | ID: mdl-22255111

RESUMEN

Gene expression and genome wide association data have provided researchers the opportunity to study many complex traits and diseases. When designing prognostic and predictive models capable of phenotypic classification in this area, significant reduction of dimensionality through stringent filtering and/or feature selection is often deemed imperative. Here, this work challenges this presumption through both theoretical and empirical analysis. This work demonstrates that by a proper compromise between structure of the selected model and the number of features, one is able to achieve better performance even in large dimensionality. The inclusion of many genes/variants in the classification rules can help shed new light on the analysis of complex traitstraits that are typically determined by many causal variants with small effect size.


Asunto(s)
Conducta , Enfermedad , Modelos Teóricos , Humanos
6.
BMC Bioinformatics ; 11 Suppl 9: S2, 2010 Oct 28.
Artículo en Inglés | MEDLINE | ID: mdl-21044360

RESUMEN

BACKGROUND: Identification of expression quantitative trait loci (eQTLs) is an emerging area in genomic study. The task requires an integrated analysis of genome-wide single nucleotide polymorphism (SNP) data and gene expression data, raising a new computational challenge due to the tremendous size of data. RESULTS: We develop a method to identify eQTLs. The method represents eQTLs as information flux between genetic variants and transcripts. We use information theory to simultaneously interrogate SNP and gene expression data, resulting in a Transcriptional Information Map (TIM) which captures the network of transcriptional information that links genetic variations, gene expression and regulatory mechanisms. These maps are able to identify both cis- and trans- regulating eQTLs. The application on a dataset of leukemia patients identifies eQTLs in the regions of the GART, PCP4, DSCAM, and RIPK4 genes that regulate ADAMTS1, a known leukemia correlate. CONCLUSIONS: The information theory approach presented in this paper is able to infer the dependence networks between SNPs and transcripts, which in turn can identify cis- and trans-eQTLs. The application of our method to the leukemia study explains how genetic variants and gene expression are linked to leukemia.


Asunto(s)
Biología Computacional/métodos , Genoma , Transcripción Genética/genética , Expresión Génica , Perfilación de la Expresión Génica , Variación Genética , Humanos , Leucemia/genética , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo
7.
Ann Hum Genet ; 74(6): 489-97, 2010 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-20846217

RESUMEN

The etiology of growth impairment in Crohn's disease (CD) has been inadequately explained by nutritional, hormonal, and/or disease-related factors, suggesting that genetics may be an additional contributor. The aim of this cross-sectional study was to investigate genetic variants associated with linear growth in pediatric-onset CD. We genotyped 951 subjects (317 CD patient-parent trios) for 64 polymorphisms within 14 CD-susceptibility and 23 stature-associated loci. Patient height-for-age Z-score < -1.64 was used to dichotomize probands into growth-impaired and nongrowth-impaired groups. The transmission disequilibrium test (TDT) was used to study association to growth impairment. There was a significant association between growth impairment in CD (height-for-age Z-score < -1.64) and a stature-related polymorphism in the dymeclin gene DYM (rs8099594) (OR = 3.2, CI [1.57-6.51], p = 0.0007). In addition, there was nominal over-transmission of two CD-susceptibility alleles, 10q21.1 intergenic region (rs10761659) and ATG16L1 (rs10210302), in growth-impaired CD children (OR = 2.36, CI [1.26-4.41] p = 0.0056 and OR = 2.45, CI [1.22-4.95] p = 0.0094, respectively). Our data indicate that genetic influences due to stature-associated and possibly CD risk alleles may predispose CD patients to alterations in linear growth. This is the first report of a link between a stature-associated locus and growth impairment in CD.


Asunto(s)
Estatura/genética , Trastornos del Crecimiento/etiología , Adolescente , Niño , Preescolar , Enfermedad de Crohn/genética , Estudios Transversales , Femenino , Predisposición Genética a la Enfermedad , Genotipo , Trastornos del Crecimiento/genética , Humanos , Lactante , Péptidos y Proteínas de Señalización Intracelular , Masculino , Proyectos Piloto , Proteínas/metabolismo , Población Blanca
8.
Mol Med ; 16(11-12): 513-26, 2010.
Artículo en Inglés | MEDLINE | ID: mdl-20811658

RESUMEN

Abilities to successfully quit smoking display substantial evidence for heritability in classic and molecular genetic studies. Genome-wide association (GWA) studies have demonstrated single-nucleotide polymorphisms (SNPs) and haplotypes that distinguish successful quitters from individuals who were unable to quit smoking in clinical trial participants and in community samples. Many of the subjects in these clinical trial samples were aided by nicotine replacement therapy (NRT). We now report novel GWA results from participants in a clinical trial that sought dose/response relationships for "precessation" NRT. In this trial, 369 European-American smokers were randomized to 21 or 42 mg NRT, initiated 2 wks before target quit dates. Ten-week continuous smoking abstinence was assessed on the basis of self-reports and carbon monoxide levels. SNP genotyping used Affymetrix 6.0 arrays. GWA results for smoking cessation success provided no P value that reached "genome-wide" significance. Compared with chance, these results do identify (a) more clustering of nominally positive results within small genomic regions, (b) more overlap between these genomic regions and those identified in six prior successful smoking cessation GWA studies and (c) sets of genes that fall into gene ontology categories that appear to be biologically relevant. The 1,000 SNPs with the strongest associations form a plausible Bayesian network; no such network is formed by randomly selected sets of SNPs. The data provide independent support, based on individual genotyping, for many loci previously nominated on the basis of data from genotyping in pooled DNA samples. These results provide further support for the idea that aid for smoking cessation may be personalized on the basis of genetic predictors of outcome.


Asunto(s)
Estudio de Asociación del Genoma Completo , Nicotina/uso terapéutico , Cese del Hábito de Fumar/métodos , Fumar/genética , Fumar/terapia , Adulto , Teorema de Bayes , Monóxido de Carbono/análisis , Pruebas Genéticas , Genotipo , Humanos , Polimorfismo de Nucleótido Simple , Tabaquismo/genética , Resultado del Tratamiento
9.
J Bacteriol ; 192(17): 4300-10, 2010 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-20601467

RESUMEN

Vibrio cholerae is a Gram-negative bacillus that is the causative agent of cholera. Pathogenesis in vivo occurs through a series of spatiotemporally controlled events under the control of a gene cascade termed the ToxR regulon. Major genes in the ToxR regulon include the master regulators toxRS and tcpPH, the downstream regulator toxT, and virulence factors, the ctxAB and tcpA operons. Our current understanding of the dynamics of virulence gene expression is limited to microarray analyses of expression at selected time points. To better understand this process, we utilized a systems biology approach to examine the temporal regulation of gene expression in El Tor V. cholerae grown under virulence-inducing conditions in vitro (AKI medium), using high-resolution time series genomic profiling. Results showed that overall gene expression in AKI medium mimics that of in vivo studies but with less clear temporal separation between upstream regulators and downstream targets. Expression of toxRS was unaffected by growth under virulence-inducing conditions, but expression of toxT was activated shortly after switching from stationary to aerating conditions. The tcpA operon was also activated early during mid-exponential-phase growth, while the ctxAB operon was turned on later, after the rise in toxT expression. Expression of ctxAB continued to rise despite an eventual decrease in toxT. Cluster analysis of gene expression highlighted 15 hypothetical genes and six genes related to environmental information processing that represent potential new members of the ToxR regulon. This study applies systems biology tools to analysis of gene expression of V. cholerae in vitro and provides an important comparator for future studies done in vivo.


Asunto(s)
Proteínas Bacterianas/metabolismo , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Biología de Sistemas/métodos , Vibrio cholerae/crecimiento & desarrollo , Vibrio cholerae/patogenicidad , Proteínas Bacterianas/genética , Técnicas Bacteriológicas , Toxina del Cólera/genética , Toxina del Cólera/metabolismo , Análisis por Conglomerados , Medios de Cultivo , Análisis de Secuencia por Matrices de Oligonucleótidos , Factores de Tiempo , Vibrio cholerae/genética , Vibrio cholerae/metabolismo , Virulencia
11.
Brief Bioinform ; 11(1): 80-95, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19906839

RESUMEN

The field of synthetic biology holds an inspiring vision for the future; it integrates computational analysis, biological data and the systems engineering paradigm in the design of new biological machines and systems. These biological machines are built from basic biomolecular components analogous to electrical devices, and the information flow among these components requires the augmentation of biological insight with the power of a formal approach to information management. Here we review the informatics challenges in synthetic biology along three dimensions: in silico, in vitro and in vivo. First, we describe state of the art of the in silico support of synthetic biology, from the specific data exchange formats, to the most popular software platforms and algorithms. Next, we cast in vitro synthetic biology in terms of information flow, and discuss genetic fidelity in DNA manipulation, development strategies of biological parts and the regulation of biomolecular networks. Finally, we explore how the engineering chassis can manipulate biological circuitries in vivo to give rise to future artificial organisms.


Asunto(s)
Biología de Sistemas , Algoritmos , Secuencia de Bases , ADN , Datos de Secuencia Molecular , Homología de Secuencia de Ácido Nucleico , Programas Informáticos
12.
Circulation ; 120(24): 2448-54, 2009 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-19948975

RESUMEN

BACKGROUND: Many different genetic and clinical factors have been identified as causes or contributors to atherosclerosis. We present a model of preclinical atherosclerosis based on genetic and clinical data that predicts the presence of coronary artery calcification in healthy Americans of European descent 45 to 84 years of age in the Multi-Ethnic Study of Atherosclerosis (MESA). METHODS AND RESULTS: We assessed 712 individuals for the presence or absence of coronary artery calcification and assessed their genotypes for 2882 single-nucleotide polymorphisms. With the use of these single-nucleotide polymorphisms and relevant clinical data, a Bayesian network that predicts the presence of coronary calcification was constructed. The model contained 13 single-nucleotide polymorphisms (from genes AGTR1, ALOX15, INSR, PRKAB1, IL1R2, ESR2, KCNK1, FBLN5, PPARA, VEGFA, PON1, TDRD6, PLA2G7, and 1 ancestry informative marker) and 5 clinical variables (sex, age, weight, smoking, and diabetes mellitus) and achieved 85% predictive accuracy, as measured by area under the receiver operating characteristic curve. This is a significant (P<0.001) improvement on models that use just the single-nucleotide polymorphism data or just the clinical variables. CONCLUSIONS: We present an investigation of joint genetic and clinical factors associated with atherosclerosis that shows predictive results for both cases, as well as enhanced performance for their combination.


Asunto(s)
Aterosclerosis/genética , Calcinosis/genética , Enfermedad de la Arteria Coronaria/genética , Modelos Cardiovasculares , Anciano , Anciano de 80 o más Años , Aterosclerosis/etnología , Teorema de Bayes , Calcinosis/etnología , Estudios de Cohortes , Enfermedad de la Arteria Coronaria/etnología , Femenino , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Valor Predictivo de las Pruebas , Factores de Riesgo , Población Blanca/etnología , Población Blanca/genética
13.
Cancer Res ; 69(23): 9029-37, 2009 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-19903842

RESUMEN

Like all primary cells in vitro, normal human melanocytes exhibit a physiologic decay in proliferative potential as it transitions to a growth-arrested state. The underlying transcriptional program(s) that regulate this phenotypic change is largely unknown. To identify molecular determinants of this process, we performed a Bayesian-based dynamic gene expression analysis on primary melanocytes undergoing proliferative arrest. This analysis revealed several related clusters whose expression behavior correlated with the melanocyte growth kinetics; we designated these clusters the melanocyte growth arrest program (MGAP). These MGAP genes were preferentially represented in benign melanocytic nevi over melanomas and selectively mapped to the hepatocyte fibrosis pathway. This transcriptional relationship between melanocyte growth stasis, nevus biology, and fibrogenic signaling was further validated in vivo by the demonstration of strong pericellular collagen deposition within benign nevi but not melanomas. Taken together, our study provides a novel view of fibroplasia in both melanocyte biology and nevogenesis.


Asunto(s)
Transformación Celular Neoplásica/genética , Melanocitos/fisiología , Melanoma/genética , Nevo/genética , Teorema de Bayes , Procesos de Crecimiento Celular/genética , Transformación Celular Neoplásica/patología , Análisis por Conglomerados , Perfilación de la Expresión Génica/métodos , Humanos , Melanocitos/citología , Melanoma/patología , Nevo/patología
14.
Pharmacogenomics ; 10(9): 1393-412, 2009 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-19761364

RESUMEN

AIMS: Bronchodilator response tests measure the effect of beta(2)-agonists, the most commonly used short-acting reliever drugs for asthma. We sought to relate candidate gene SNP data with bronchodilator response and measure the predictive accuracy of a model constructed with genetic variants. MATERIALS & METHODS: Bayesian networks, multivariate models that are able to account for simultaneous associations and interactions among variables, were used to create a predictive model of bronchodilator response using candidate gene SNP data from 308 Childhood Asthma Management Program Caucasian subjects. RESULTS: The model found that 15 SNPs in 15 genes predict bronchodilator response with fair accuracy, as established by a fivefold cross-validation area under the receiver-operating characteristic curve of 0.75 (standard error: 0.03). CONCLUSION: Bayesian networks are an attractive approach to analyze large-scale pharmacogenetic SNP data because of their ability to automatically learn complex models that can be used for the prediction and discovery of novel biological hypotheses.


Asunto(s)
Asma/tratamiento farmacológico , Asma/genética , Broncodilatadores/uso terapéutico , Asma/fisiopatología , Teorema de Bayes , Niño , Interpretación Estadística de Datos , Femenino , Variación Genética , Genotipo , Humanos , Modelos Logísticos , Masculino , Redes Neurales de la Computación , Farmacogenética , Polimorfismo de Nucleótido Simple , Valor Predictivo de las Pruebas , Reproducibilidad de los Resultados , Pruebas de Función Respiratoria
15.
BMC Bioinformatics ; 10 Suppl 9: S1, 2009 Sep 17.
Artículo en Inglés | MEDLINE | ID: mdl-19761563

RESUMEN

BACKGROUND: Gene interactions play a central role in transcriptional networks. Many studies have performed genome-wide expression analysis to reconstruct regulatory networks to investigate disease processes. Since biological processes are outcomes of regulatory gene interactions, this paper develops a system biology approach to infer function-dependent transcriptional networks modulating phenotypic traits, which serve as a classifier to identify tissue states. Due to gene interactions taken into account in the analysis, we can achieve higher classification accuracy than existing methods. RESULTS: Our system biology approach is carried out by the Bayesian networks framework. The algorithm consists of two steps: gene filtering by Bayes factor followed by collinearity elimination via network learning. We validate our approach with two clinical data. In the study of lung cancer subtypes discrimination, we obtain a 25-gene classifier from 111 training samples, and the test on 422 independent samples achieves 95% classification accuracy. In the study of thoracic aortic aneurysm (TAA) diagnosis, 61 samples determine a 34-gene classifier, whose diagnosis accuracy on 33 independent samples achieves 82%. The performance comparisons with three other popular methods, PCA/LDA, PAM, and Weighted Voting, confirm that our approach yields superior classification accuracy and a more compact signature. CONCLUSIONS: The system biology approach presented in this paper is able to infer function-dependent transcriptional networks, which in turn can classify biological samples with high accuracy. The validation of our classifier using clinical data demonstrates the promising value of our proposed approach for disease diagnosis.


Asunto(s)
Redes Reguladoras de Genes , Transcripción Genética , Aneurisma de la Aorta Torácica/diagnóstico , Aneurisma de la Aorta Torácica/genética , Teorema de Bayes , Humanos , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Biología de Sistemas
16.
J Am Med Inform Assoc ; 16(3): 371-9, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19261943

RESUMEN

OBJECTIVE: Identify clinical factors that modulate the risk of progression to COPD among asthma patients using data extracted from electronic medical records. DESIGN: Demographic information and comorbidities from adult asthma patients who were observed for at least 5 years with initial observation dates between 1988 and 1998, were extracted from electronic medical records of the Partners Healthcare System using tools of the National Center for Biomedical Computing "Informatics for Integrating Biology to the Bedside" (i2b2). MEASUREMENTS: A predictive model of COPD was constructed from a set of 9,349 patients (843 cases, 8,506 controls) using Bayesian networks. The model's predictive accuracy was tested using it to predict COPD in a future independent set of asthma patients (992 patients; 46 cases, 946 controls), who had initial observation dates between 1999 and 2002. RESULTS: A Bayesian network model composed of age, sex, race, smoking history, and 8 comorbidity variables is able to predict COPD in the independent set of patients with an accuracy of 83.3%, computed as the area under the Receiver Operating Characteristic curve (AUROC). CONCLUSIONS: Our results demonstrate that data extracted from electronic medical records can be used to create predictive models. With improvements in data extraction and inclusion of more variables, such models may prove to be clinically useful.


Asunto(s)
Asma/complicaciones , Teorema de Bayes , Sistemas de Registros Médicos Computarizados , Redes Neurales de la Computación , Enfermedad Pulmonar Obstructiva Crónica/etiología , Anciano , Comorbilidad , Progresión de la Enfermedad , Femenino , Humanos , Masculino , Persona de Mediana Edad , Modelos Biológicos , Análisis Multivariante , Procesamiento de Lenguaje Natural , Curva ROC , Factores de Riesgo
17.
J Neurogenet ; 23(3): 283-92, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19184766

RESUMEN

Individuals' dependence on nicotine, primarily through cigarette smoking, is a major source of morbidity and mortality worldwide. Many smokers attempt but fail to quit smoking, motivating researchers to identify the origins of this dependence. Because of the known heritability of nicotine-dependence phenotypes, considerable interest has been focused on discovering the genetic factors underpinning the trait. This goal, however, is not easily attained: no single factor is likely to explain any great proportion of dependence because nicotine dependence is thought to be a complex trait (i.e., the result of many interacting factors). Genomewide association studies are powerful tools in the search for the genomic bases of complex traits, and in this context, novel candidate genes have been identified through single nucleotide polymorphism (SNP) association analyses. Beyond association, however, genetic data can be used to generate predictive models of nicotine dependence. As expected in the context of a complex trait, individual SNPs fail to accurately predict nicotine dependence, demanding the use of multivariate models. Standard approaches, such as logistic regression, are unable to consider large numbers of SNPs given existing sample sizes. However, using Bayesian networks, one can overcome these limitations to generate a multivariate predictive model, which has markedly enhanced predictive accuracy on fitted values relative to that of individual SNPs. This approach, combined with the data being generated by genomewide association studies, promises to shed new light on the common, complex trait nicotine dependence.


Asunto(s)
Modelos Biológicos , Tabaquismo/diagnóstico , Animales , Teorema de Bayes , Estudios de Asociación Genética , Humanos , Pronóstico , Tabaquismo/genética , Tabaquismo/fisiopatología
18.
J Biomed Inform ; 42(2): 287-95, 2009 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-18790084

RESUMEN

Though genome-wide technologies, such as microarrays, are widely used, data from these methods are considered noisy; there is still varied success in downstream biological validation. We report a method that increases the likelihood of successfully validating microarray findings using real time RT-PCR, including genes at low expression levels and with small differences. We use a Bayesian network to identify the most relevant sources of noise based on the successes and failures in validation for an initial set of selected genes, and then improve our subsequent selection of genes for validation based on eliminating these sources of noise. The network displays the significant sources of noise in an experiment, and scores the likelihood of validation for every gene. We show how the method can significantly increase validation success rates. In conclusion, in this study, we have successfully added a new automated step to determine the contributory sources of noise that determine successful or unsuccessful downstream biological validation.


Asunto(s)
Teorema de Bayes , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Análisis de Varianza , Animales , Retinopatía Diabética/genética , Retinopatía Diabética/metabolismo , Genómica , Hiperoxia/genética , Hiperoxia/metabolismo , Ratones , Modelos Estadísticos , ARN Mensajero/análisis , Reproducibilidad de los Resultados
19.
Stroke ; 40(3 Suppl): S67-70, 2009 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-19064790

RESUMEN

Cardioembolic stroke is a complex disease resulting from the interaction of numerous factors. Using data from Genes Affecting Stroke Risk and Outcome Study (GASROS), we show that a multivariate predictive model built using Bayesian networks is able to achieve a predictive accuracy of 86% on the fitted values as computed by the area under the receiver operating characteristic curve relative to that of the individual single nucleotide polymorphism with the highest prognostic performance (area under the receiver operating characteristic curve=60%).


Asunto(s)
Genómica , Modelos Genéticos , Accidente Cerebrovascular/genética , Teorema de Bayes , Humanos , Análisis Multivariante , Polimorfismo de Nucleótido Simple/genética , Pronóstico , Factores de Riesgo , Accidente Cerebrovascular/diagnóstico , Accidente Cerebrovascular/epidemiología , Población Blanca/genética
20.
AMIA Annu Symp Proc ; : 308-12, 2008 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-18999057

RESUMEN

The increasing availability of electronic medical records offers opportunities to better characterize patient populations and create predictive tools to individualize health care. We determined which asthma patients suffer exacerbations using data extracted from electronic medical records of the Partners Healthcare System using Natural Language Processing tools from the "Informatics for Integrating Biology to the Bedside" center (i2b2). Univariable and multivariable analysis of data for 11,356 patients (1,394 cases, 9,962 controls) found that race, BMI, smoking history, and age at initial observation are predictors of asthma exacerbations. The area under the receiver operating characteristic curve (AUROC) corresponding to prediction of exacerbations in an independent group of 1,436 asthma patients (106 cases, 1,330 controls) is 0.67. Our findings are consistent with previous characterizations of asthma patients in epidemiological studies, and demonstrate that data extracted by natural language processing from electronic medical records is suitable for the characterization of patient populations.


Asunto(s)
Inteligencia Artificial , Asma/diagnóstico , Asma/epidemiología , Sistemas de Registros Médicos Computarizados/estadística & datos numéricos , Procesamiento de Lenguaje Natural , Reconocimiento de Normas Patrones Automatizadas/métodos , Boston/epidemiología , Enfermedad Crónica , Humanos , Incidencia , Medición de Riesgo/métodos , Factores de Riesgo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA