Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
1.
Endocrinol Metab (Seoul) ; 38(4): 406-417, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37533176

RESUMEN

BACKGRUOUND: While the triglyceride-glucose (TyG) index is a measure of insulin resistance, its association with cardiovascular disease (CVD) has not been well elucidated. We evaluated the TyG index for prediction of CVDs in a prospective large communitybased cohort. METHODS: Individuals 40 to 70 years old were prospectively followed for a median 15.6 years. The TyG index was calculated as the Ln [fasting triglycerides (mg/dL)×fasting glucose (mg/dL)/2]. CVDs included any acute myocardial infarction, coronary artery disease or cerebrovascular disease. We used a Cox proportional hazards model to estimate CVD risks according to quartiles of the TyG index and plotted the receiver operating characteristics curve for the incident CVD. RESULTS: Among 8,511 subjects (age 51.9±8.8 years; 47.5% males), 931 (10.9%) had incident CVDs during the follow-up. After adjustment for age, sex, body mass index, diabetes mellitus, hypertension, total cholesterol, smoking, alcohol, exercise, and C-reactive protein, subjects in the highest TyG quartile had 36% increased risk of incident CVD compared with the lowest TyG quartile (hazard ratio, 1.36; 95% confidence interval, 1.10 to 1.68). Carotid plaque, assessed by ultrasonography was more frequent in subjects in the higher quartile of TyG index (P for trend=0.049 in men and P for trend <0.001 in women). The TyG index had a higher predictive power for CVDs than the homeostasis model assessment of insulin resistance (HOMA-IR) (area under the curve, 0.578 for TyG and 0.543 for HOMA-IR). Adding TyG index on diabetes or hypertension alone gave sounder predictability for CVDs. CONCLUSION: The TyG index is independently associated with future CVDs in 16 years of follow-up in large, prospective Korean cohort.


Asunto(s)
Aterosclerosis , Enfermedades Cardiovasculares , Diabetes Mellitus , Hipertensión , Resistencia a la Insulina , Masculino , Humanos , Femenino , Adulto , Persona de Mediana Edad , Anciano , Glucosa , Enfermedades Cardiovasculares/epidemiología , Estudios de Cohortes , Estudios de Seguimiento , Estudios Prospectivos , Triglicéridos , Vida Independiente , Glucemia/metabolismo , Diabetes Mellitus/epidemiología , Aterosclerosis/diagnóstico , Aterosclerosis/epidemiología
2.
Dev Psychopathol ; : 1-11, 2022 Dec 16.
Artículo en Inglés | MEDLINE | ID: mdl-36524242

RESUMEN

Parents share half of their genes with their children, but they also share background social factors and actively help shape their child's environment - making it difficult to disentangle genetic and environmental causes of parent-offspring similarity. While adoption and extended twin family designs have been extremely useful for distinguishing genetic and nongenetic parental influences, these designs entail stringent assumptions about phenotypic similarity between relatives and require samples that are difficult to collect and therefore are typically small and not publicly shared. Here, we describe these traditional designs, as well as modern approaches that use large, publicly available genome-wide data sets to estimate parental effects. We focus in particular on an approach we recently developed, structural equation modeling (SEM)-polygenic score (PGS), that instantiates the logic of modern PGS-based methods within the flexible SEM framework used in traditional designs. Genetically informative designs such as SEM-PGS rely on different and, in some cases, less rigid assumptions than traditional approaches; thus, they allow researchers to capitalize on new data sources and answer questions that could not previously be investigated. We believe that SEM-PGS and similar approaches can lead to improved insight into how nature and nurture combine to create the incredible diversity underlying human behavior.

3.
Nat Genet ; 54(5): 581-592, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35534559

RESUMEN

Estimates from genome-wide association studies (GWAS) of unrelated individuals capture effects of inherited variation (direct effects), demography (population stratification, assortative mating) and relatives (indirect genetic effects). Family-based GWAS designs can control for demographic and indirect genetic effects, but large-scale family datasets have been lacking. We combined data from 178,086 siblings from 19 cohorts to generate population (between-family) and within-sibship (within-family) GWAS estimates for 25 phenotypes. Within-sibship GWAS estimates were smaller than population estimates for height, educational attainment, age at first birth, number of children, cognitive ability, depressive symptoms and smoking. Some differences were observed in downstream SNP heritability, genetic correlations and Mendelian randomization analyses. For example, the within-sibship genetic correlation between educational attainment and body mass index attenuated towards zero. In contrast, analyses of most molecular phenotypes (for example, low-density lipoprotein-cholesterol) were generally consistent. We also found within-sibship evidence of polygenic adaptation on taller height. Here, we illustrate the importance of family-based GWAS data for phenotypes influenced by demographic and indirect genetic effects.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Humanos , Análisis de la Aleatorización Mendeliana , Herencia Multifactorial/genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética
4.
Front Genet ; 12: 634922, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34267778

RESUMEN

In the "personalized medicine" era, one of the most difficult problems is identification of combined markers from different omics platforms. Many methods have been developed to identify candidate markers for each type of omics data, but few methods facilitate the identification of multiple markers on multi-omics platforms. microRNAs (miRNAs) is well known to affect only indirectly phenotypes by regulating mRNA expression and/or protein translation. To take into account this knowledge into practice, we suggest a miRNA-mRNA integration model for survival time analysis, called mimi-surv, which accounts for the biological relationship, to identify such integrated markers more efficiently. Through simulation studies, we found that the statistical power of mimi-surv be better than other models. Application to real datasets from Seoul National University Hospital and The Cancer Genome Atlas demonstrated that mimi-surv successfully identified miRNA-mRNA integrations sets associated with progression-free survival of pancreatic ductal adenocarcinoma (PDAC) patients. Only mimi-surv found miR-96, a previously unidentified PDAC-related miRNA in these two real datasets. Furthermore, mimi-surv was shown to identify more PDAC related miRNAs than other methods because it used the known structure for miRNA-mRNA regularization. An implementation of mimi-surv is available at http://statgen.snu.ac.kr/software/mimi-surv.

5.
Psychiatry Investig ; 18(5): 453-462, 2021 May.
Artículo en Inglés | MEDLINE | ID: mdl-33993688

RESUMEN

OBJECTIVE: Bipolar disorder (BD) is complex genetic disorder. Therefore, approaches using clinical phenotypes such as biological rhythm disruption could be an alternative. In this study, we explored the relationship between melatonin pathway genes with circadian and seasonal rhythms of BD. METHODS: We recruited clinically stable patients with BD (n=324). We measured the seasonal variation of mood and behavior (seasonality), and circadian preference, on a lifetime basis. We analyzed 34 variants in four genes (MTNR1a, MTNR1b, AANAT, ASMT) involved in the melatonin pathway. RESULTS: Four variants were nominally associated with seasonality and circadian preference. After multiple test corrections, the rs116879618 in AANAT remained significantly associated with seasonality (corrected p=0.0151). When analyzing additional variants of AANAT through imputation, the rs117849139, rs77121614 and rs28936679 (corrected p=0.0086, 0.0154, and 0.0092) also showed a significant association with seasonality. CONCLUSION: This is the first study reporting the relationship between variants of AANAT and seasonality in patients with BD. Since AANAT controls the level of melatonin production in accordance with light and darkness, this study suggests that melatonin may be involved in the pathogenesis of BD, which frequently shows a seasonality of behaviors and symptom manifestations.

7.
Sci Total Environ ; 772: 145386, 2021 Jun 10.
Artículo en Inglés | MEDLINE | ID: mdl-33770858

RESUMEN

Soil organic matter (SOM) is related to vegetation, soil bacteria, and soil properties; however, not many studies link all these parameters simultaneously, particularly in tundra ecosystems vulnerable to climate change. Our aim was to describe the relationships between vegetation, bacteria, soil properties, and SOM composition in moist acidic tundra by integrating physical, chemical, and molecular methods. A total of 70 soil samples were collected at two different depths from 36 spots systematically arranged over an area of about 300 m × 50 m. Pyrolysis-gas chromatography/mass spectrometry and pyrosequencing of the 16S rRNA gene were used to identify the molecular compositions of the SOM and bacterial community, respectively. Vegetation and soil physicochemical properties were also measured. The sampling sites were grouped into three, based on their SOM compositions: Sphagnum moss-derived SOM, lipid-rich materials, and aromatic-rich materials. Our results show that SOM composition is spatially structured and linked to microtopography; however, the vegetation, soil properties, and bacterial community composition did not show overall spatial structuring. Simultaneously, soil properties and bacterial community composition were the main factors explaining SOM compositional variation, while vegetation had a residual effect. Verrucomicrobia and Acidobacteria were related to polysaccharides, and Chloroflexi was linked to aromatic compounds. These relationships were consistent across different hierarchical levels. Our results suggest that SOM composition at a local scale is closely linked with soil factors and the bacterial community. Comprehensive observation of ecosystem components is recommended to understand the in-situ function of bacteria and the fate of SOM in the moist acidic tundra.


Asunto(s)
Ecosistema , Suelo , Alaska , Bacterias/genética , ARN Ribosómico 16S/genética , Microbiología del Suelo , Tundra
8.
Ann Surg Treat Res ; 100(3): 144-153, 2021 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-33748028

RESUMEN

PURPOSE: Diagnostic biomarkers of pancreatic ductal adenocarcinoma (PDAC) have been used for early detection to reduce its dismal survival rate. However, clinically feasible biomarkers are still rare. Therefore, in this study, we developed an automated multi-marker enzyme-linked immunosorbent assay (ELISA) kit using 3 biomarkers (leucine-rich alpha-2-glycoprotein [LRG1], transthyretin [TTR], and CA 19-9) that were previously discovered and proposed a diagnostic model for PDAC based on this kit for clinical usage. METHODS: Individual LRG1, TTR, and CA 19-9 panels were combined into a single automated ELISA panel and tested on 728 plasma samples, including PDAC (n = 381) and normal samples (n = 347). The consistency between individual panels of 3 biomarkers and the automated multi-panel ELISA kit were accessed by correlation. The diagnostic model was developed using logistic regression according to the automated ELISA kit to predict the risk of pancreatic cancer (high-, intermediate-, and low-risk groups). RESULTS: The Pearson correlation coefficient of predicted values between the triple-marker automated ELISA panel and the former individual ELISA was 0.865. The proposed model provided reliable prediction results with a positive predictive value of 92.05%, negative predictive value of 90.69%, specificity of 90.69%, and sensitivity of 92.05%, which all simultaneously exceed 90% cutoff value. CONCLUSION: This diagnostic model based on the triple ELISA kit showed better diagnostic performance than previous markers for PDAC. In the future, it needs external validation to be used in the clinic.

9.
Behav Genet ; 51(3): 264-278, 2021 05.
Artículo en Inglés | MEDLINE | ID: mdl-33387133

RESUMEN

Offspring resemble their parents for both genetic and environmental reasons. Understanding the relative magnitude of these alternatives has long been a core interest in behavioral genetics research, but traditional designs, which compare phenotypic covariances to make inferences about unmeasured genetic and environmental factors, have struggled to disentangle them. Recently, Kong et al. (2018) showed that by correlating offspring phenotypic values with the measured polygenic score of parents' nontransmitted alleles, one can estimate the effect of "genetic nurture"-a type of passive gene-environment covariation that arises when heritable parental traits directly influence offspring traits. Here, we instantiate this basic idea in a set of causal models that provide novel insights into the estimation of parental influences on offspring. Most importantly, we show how jointly modeling the parental polygenic scores and the offspring phenotypes can provide an unbiased estimate of the variation attributable to the environmental influence of parents on offspring, even when the polygenic score accounts for a small fraction of trait heritability. This model can be further extended to (a) account for the influence of different types of assortative mating, (b) estimate the total variation due to additive genetic effects and their covariance with the familial environment (i.e., the full genetic nurture effect), and (c) model situations where a parental trait influences a different offspring trait. By utilizing structural equation modeling techniques developed for extended twin family designs, our approach provides a general framework for modeling polygenic scores in family studies and allows for various model extensions that can be used to answer old questions about familial influences in new ways.


Asunto(s)
Herencia Materna/genética , Herencia Paterna/genética , Estadística como Asunto/métodos , Alelos , Interacción Gen-Ambiente , Genotipo , Humanos , Modelos Genéticos , Modelos Teóricos , Herencia Multifactorial/genética , Relaciones Padres-Hijo , Padres/psicología , Fenotipo , Gemelos/genética
10.
Behav Genet ; 51(3): 279-288, 2021 05.
Artículo en Inglés | MEDLINE | ID: mdl-33301082

RESUMEN

In a companion paper Balbona et al. (Behav Genet, in press), we introduced a series of causal models that use polygenic scores from transmitted and nontransmitted alleles, the offspring trait, and parental traits to estimate the variation due to the environmental influences the parental trait has on the offspring trait (vertical transmission) as well as additive genetic effects. These models also estimate and account for the gene-gene and gene-environment covariation that arises from assortative mating and vertical transmission respectively. In the current study, we simulated polygenic scores and phenotypes of parents and offspring under genetic and vertical transmission scenarios, assuming two types of assortative mating. We instantiated the models from our companion paper in the OpenMx software, and compared the true values of parameters to maximum likelihood estimates from models fitted on the simulated data to quantify the bias and precision of estimates. We show that parameter estimates from these models are unbiased when assumptions are met, but as expected, they are biased to the degree that assumptions are unmet. Standard errors of the estimated variances due to vertical transmission and to genetic effects decrease with increasing sample sizes and with increasing [Formula: see text] values of the polygenic score. Even when the polygenic score explains a modest amount of trait variation ([Formula: see text]), standard errors of these standardized estimates are reasonable ([Formula: see text]) for [Formula: see text] trios, and can even be reasonable for smaller sample sizes (e.g., down to 4K) when the polygenic score is more predictive. These causal models offer a novel approach for understanding how parents influence their offspring, but their use requires polygenic scores on relevant traits that are modestly predictive (e.g., [Formula: see text] as well as datasets with genomic and phenotypic information on parents and offspring. The utility of polygenic scores for elucidating parental influences should thus serve as additional motivation for large genomic biobanks to perform GWAS's on traits that may be relevant to parenting and to oversample close relatives, particularly parents and offspring.


Asunto(s)
Herencia Materna/genética , Herencia Paterna/genética , Estadística como Asunto/métodos , Alelos , Sesgo , Interacción Gen-Ambiente , Estudio de Asociación del Genoma Completo , Genómica , Genotipo , Humanos , Funciones de Verosimilitud , Modelos Genéticos , Modelos Teóricos , Herencia Multifactorial/genética , Relaciones Padres-Hijo , Responsabilidad Parental , Fenotipo , Gemelos/genética
11.
Genes (Basel) ; 10(11)2019 11 14.
Artículo en Inglés | MEDLINE | ID: mdl-31739607

RESUMEN

Although there have been several analyses for identifying cancer-associated pathways, based on gene expression data, most of these are based on single pathway analyses, and thus do not consider correlations between pathways. In this paper, we propose a hierarchical structural component model for pathway analysis of gene expression data (HisCoM-PAGE), which accounts for the hierarchical structure of genes and pathways, as well as the correlations among pathways. Specifically, HisCoM-PAGE focuses on the survival phenotype and identifies its associated pathways. Moreover, its application to real biological data analysis of pancreatic cancer data demonstrated that HisCoM-PAGE could successfully identify pathways associated with pancreatic cancer prognosis. Simulation studies comparing the performance of HisCoM-PAGE with other competing methods such as Gene Set Enrichment Analysis (GSEA), Global Test, and Wald-type Test showed HisCoM-PAGE to have the highest power to detect causal pathways in most simulation scenarios.


Asunto(s)
Carcinoma Ductal Pancreático/genética , Análisis de Datos , Regulación Neoplásica de la Expresión Génica , Modelos Genéticos , Neoplasias Pancreáticas/genética , Anciano , Algoritmos , Carcinoma Ductal Pancreático/mortalidad , Simulación por Computador , Bases de Datos Genéticas/estadística & datos numéricos , Conjuntos de Datos como Asunto , Estudios de Factibilidad , Femenino , Redes Reguladoras de Genes , Humanos , Masculino , Persona de Mediana Edad , Análisis de Secuencia por Matrices de Oligonucleótidos/estadística & datos numéricos , Neoplasias Pancreáticas/mortalidad , Pronóstico , RNA-Seq/estadística & datos numéricos , República de Corea/epidemiología , Análisis de Supervivencia
12.
BMC Med Genomics ; 12(Suppl 5): 100, 2019 07 11.
Artículo en Inglés | MEDLINE | ID: mdl-31296220

RESUMEN

BACKGROUNDS: Recent large-scale genetic studies often involve clustered phenotypes such as repeated measurements. Compared to a series of univariate analyses of single phenotypes, an analysis of clustered phenotypes can be useful for substantially increasing statistical power to detect more genetic associations. Moreover, for the analysis of rare variants, incorporation of biological information can boost weak effects of the rare variants. RESULTS: Through simulation studies, we showed that the proposed method outperforms other method currently available for pathway-level analysis of clustered phenotypes. Moreover, a real data analysis using a large-scale whole exome sequencing dataset of 995 samples with metabolic syndrome-related phenotypes successfully identified the glyoxylate and dicarboxylate metabolism pathway that could not be identified by the univariate analyses of single phenotypes and other existing method. CONCLUSION: In this paper, we introduced a novel pathway-level association test by combining hierarchical structured components analysis and penalized generalized estimating equations. The proposed method analyzes all pathways in a single unified model while considering their correlations. C/C++ implementation of PHARAOH-GEE is publicly available at http://statgen.snu.ac.kr/software/pharaoh-gee/ .


Asunto(s)
Biología Computacional/métodos , Variación Genética , Fenotipo , Análisis por Conglomerados , Secuenciación del Exoma
13.
BMC Med Genomics ; 12(Suppl 5): 102, 2019 07 11.
Artículo en Inglés | MEDLINE | ID: mdl-31296221

RESUMEN

BACKGROUND: In genome-wide association studies (GWASs), meta-analysis has been widely used to improve statistical power by combining the results of different studies. Meta-analysis can detect phenotype associated variants that are failed to be detected in single studies. Especially, in biomedical sciences, meta-analysis is often necessary not only for improving statistical power, but also for reducing unavoidable limitation in data collection. As next-generation sequencing (NGS) technology has been developed, meta-analysis of rare variants is proceeding briskly along with meta-analysis of common variants in GWASs. However, meta-analysis on a single variant that is commonly used in common variant association test is improper for rare variants. A sparse signal of rare variant undermines the association signal and its large number causes multiple testing problem. To over-come these problems, we propose a meta-analysis method at the gene-level rather than variant level. RESULTS: Among many methods that have been developed, we used the unified quadratic tests (Q-tests); Q-test is more powerful than or as powerful as other tests such as Sequence Kernel Association Tests (SKAT). Since there are three different versions of Q-test (QTest1, QTest2, QTest3), each assumes different relationships among multiple rare variants, we extended them into meta-study accordingly. For meta-analysis, we consider two types of approaches, the one is to combine regression coefficients and the other is to combine test statistics from each single study. We extend the Q-test for meta-analysis, proposing Meta Quadratic Test (Meta-Qtest). Meta Q-test avoids the limitations of MetaSKAT. It does not only consider genetic heterogeneity among studies as MetaSKAT but also reflects diverse real situations; since we extend three different Q-tests into meta-analysis respectively, flexible Meta Q-test suggests way to deal with gene-level rare variant meta-analysis efficiently From the results of real data analysis of blood pressure trait, our meta-analysis could successfully discovered genes, KCNA5 and CABIN1 that are already well known for relevance with hypertension disease and they are not detected in MetaSKAT. CONCLUSION: As exemplified by an application to T2D Genes projects data set, Meta-Qtest more effectively identified genes associated with hypertension disease than MetaSKAT did.


Asunto(s)
Variación Genética , Metaanálisis como Asunto
14.
Genomics Inform ; 17(1): e10, 2019 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-30929411

RESUMEN

To identify miRNA-mRNA interaction pairs associated with binary phenotypes, we propose a hierarchical structural component model for miRNA-mRNA integration (HisCoM-mimi). Information on known mRNA targets provided by TargetScan is used to perform HisCoM-mimi. However, multiple databases can be used to find miRNA-mRNA signatures with known biological information through different algorithms. To take these additional databases into account, we present our advanced application software for HisCoM-mimi for binary phenotypes. The proposed HisCoM-mimi supports both TargetScan and miRTarBase, which provides manually-verified information initially gathered by text-mining the literature. By integrating information from miRTarBase into HisCoM-mimi, a broad range of target information derived from the research literature can be analyzed. Another improvement of the new HisCoM-mimi approach is the inclusion of updated algorithms to provide the lasso and elastic-net penalties for users who want to fit a model with a smaller number of selected miRNAs and mRNAs. We expect that our HisCoM-mimi software will make advanced methods accessible to researchers who want to identify miRNA-mRNA interaction pairs related with binary phenotypes.

15.
Psychol Med ; 49(13): 2177-2185, 2019 10.
Artículo en Inglés | MEDLINE | ID: mdl-30326977

RESUMEN

BACKGROUND: Given its diverse disease courses and symptom presentations, multiple phenotype dimensions with different biological underpinnings are expected with bipolar disorders (BPs). In this study, we aimed to identify lifetime BP psychopathology dimensions. We also explored the differing associations with bipolar I (BP-I) and bipolar II (BP-II) disorders. METHODS: We included a total of 307 subjects with BPs in the analysis. For the factor analysis, we chose six variables related to clinical courses, 29 indicators covering lifetime symptoms of mood episodes, and 6 specific comorbid conditions. To determine the relationships among the identified phenotypic dimensions and their effects on differentiating BP subtypes, we applied structural equation modeling. RESULTS: We selected a six-factor solution through scree plot, Velicer's minimum average partial test, and face validity evaluations; the six factors were cyclicity, depression, atypical vegetative symptoms, elation, psychotic/irritable mania, and comorbidity. In the path analysis, five factors excluding atypical vegetative symptoms were associated with one another. Cyclicity, depression, and comorbidity had positive associations, and they correlated negatively with psychotic/irritable mania; elation showed positive correlations with cyclicity and psychotic/irritable mania. Depression, cyclicity, and comorbidity were stronger in BP-II than in BP-I, and they contributed significantly to the distinction between the two disorders. CONCLUSIONS: We identified six phenotype dimensions; in addition to symptom features of manic and depressive episodes, various comorbidities and high cyclicity constructed separate dimensions. Except for atypical vegetative symptoms, all factors showed a complex interdependency and played roles in discriminating BP-II from BP-I.


Asunto(s)
Trastorno Bipolar/psicología , Depresión/psicología , Adulto , Anciano , Comorbilidad , Análisis Factorial , Femenino , Humanos , Masculino , Persona de Mediana Edad , Fenotipo , Psicopatología , República de Corea
16.
Methods Mol Biol ; 1882: 23-32, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30378041

RESUMEN

A nomogram is a useful graphical tool for presenting a risk prediction and prognosis prediction in medical research. Intraductal papillary mucinous neoplasm (IPMN) is the premalignant lesions of the pancreas. Among the IPMN, branch duct (BD) IPMN is hard to determine whether progress to an invasive tumor or not. Surgery on the pancreas part is likely to lower the quality of life of the patient, so avoiding surgery to remove IPMN tissue of the patients with low risk should be carefully decided. In this study, we introduce the process of constructing a nomogram and illustrate it with a prediction model to predict malignancy of IPMN.


Asunto(s)
Nomogramas , Quiste Pancreático/diagnóstico , Neoplasias Intraductales Pancreáticas/diagnóstico , Adulto , Anciano , Anciano de 80 o más Años , Progresión de la Enfermedad , Femenino , Humanos , Masculino , Persona de Mediana Edad , Pancreatectomía/efectos adversos , Quiste Pancreático/patología , Quiste Pancreático/terapia , Neoplasias Intraductales Pancreáticas/patología , Neoplasias Intraductales Pancreáticas/terapia , Selección de Paciente , Calidad de Vida , Curva ROC
17.
J Bioinform Comput Biol ; 16(6): 1840026, 2018 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-30567476

RESUMEN

Although genome-wide association studies (GWAS) have successfully identified thousands of single nucleotide polymorphisms (SNPs) associated with common diseases, these observations are limited for fully explaining "missing heritability". Determining gene-gene interactions (GGI) are one possible avenue for addressing the missing heritability problem. While many statistical approaches have been proposed to detect GGI, most of these focus primarily on SNP-to-SNP interactions. While there are many advantages of gene-based GGI analyses, such as reducing the burden of multiple-testing correction, and increasing power by aggregating multiple causal signals across SNPs in specific genes, only a few methods are available. In this study, we proposed a new statistical approach for gene-based GGI analysis, "Hierarchical structural CoMponent analysis of Gene-Gene Interactions" (HisCoM-GGI). HisCoM-GGI is based on generalized structured component analysis, and can consider hierarchical structural relationships between genes and SNPs. For a pair of genes, HisCoM-GGI first effectively summarizes all possible pairwise SNP-SNP interactions into a latent variable, from which it then performs GGI analysis. HisCoM-GGI can evaluate both gene-level and SNP-level interactions. Through simulation studies, HisCoM-GGI demonstrated higher statistical power than existing gene-based GGI methods, in analyzing a GWAS of a Korean population for identifying GGI associated with body mass index. Resultantly, HisCoM-GGI successfully identified 14 potential GGI, two of which, (NCOR2 × SPOCK1) and (LINGO2 × ZNF385D) were successfully replicated in independent datasets. We conclude that HisCoM-GGI method may be a valuable tool for genome to identify GGI in missing heritability, allowing us to better understand the biological genetic mechanisms of complex traits. We conclude that HisCoM-GGI method may be a valuable tool for genome to identify GGI in missing heritability, allowing us to better understand biological genetic mechanisms of complex traits. An implementation of HisCoM-GGI can be downloaded from the website ( http://statgen.snu.ac.kr/software/hiscom-ggi ).


Asunto(s)
Índice de Masa Corporal , Epistasis Genética , Genómica/métodos , Polimorfismo de Nucleótido Simple , Pueblo Asiatico/genética , Bases de Datos Genéticas , Femenino , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Genómica/estadística & datos numéricos , Humanos , Masculino , Proteínas de la Membrana/genética , Modelos Genéticos , Proteínas del Tejido Nervioso/genética , Co-Represor 2 de Receptor Nuclear/genética , Proteoglicanos/genética , Factores de Transcripción/genética
18.
BioData Min ; 11: 27, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30564286

RESUMEN

BACKGROUND: One strategy for addressing missing heritability in genome-wide association study is gene-gene interaction analysis, which, unlike a single gene approach, involves high-dimensionality. The multifactor dimensionality reduction method (MDR) has been widely applied to reduce multi-levels of genotypes into high or low risk groups. The Cox-MDR method has been proposed to detect gene-gene interactions associated with the survival phenotype by using the martingale residuals from a Cox model. However, this method requires a cross-validation procedure to find the best SNP pair among all possible pairs and the permutation procedure should be followed for the significance of gene-gene interactions. Recently, the unified model based multifactor dimensionality reduction method (UM-MDR) has been proposed to unify the significance testing with the MDR algorithm within the regression model framework, in which neither cross-validation nor permutation testing are needed. In this paper, we proposed a simple approach, called Cox UM-MDR, which combines Cox-MDR with the key procedure of UM-MDR to identify gene-gene interactions associated with the survival phenotype. RESULTS: The simulation study was performed to compare Cox UM-MDR with Cox-MDR with and without the marginal effects of SNPs. We found that Cox UM-MDR has similar power to Cox-MDR without marginal effects, whereas it outperforms Cox-MDR with marginal effects and more robust to heavy censoring. We also applied Cox UM-MDR to a dataset of leukemia patients and detected gene-gene interactions with regard to the survival time. CONCLUSION: Cox UM-MDR is easily implemented by combining Cox-MDR with UM-MDR to detect the significant gene-gene interactions associated with the survival time without cross-validation and permutation testing. The simulation results are shown to demonstrate the utility of the proposed method, which achieves at least the same power as Cox-MDR in most scenarios, and outperforms Cox-MDR when some SNPs having only marginal effects might mask the detection of the causal epistasis.

19.
BMC Bioinformatics ; 19(Suppl 4): 75, 2018 05 08.
Artículo en Inglés | MEDLINE | ID: mdl-29745843

RESUMEN

BACKGROUND: Identification of multi-markers is one of the most challenging issues in personalized medicine era. Nowadays, many different types of omics data are generated from the same subject. Although many methods endeavor to identify candidate markers, for each type of omics data, few or none can facilitate such identification. RESULTS: It is well known that microRNAs affect phenotypes only indirectly, through regulating mRNA expression and/or protein translation. Toward addressing this issue, we suggest a hierarchical structured component analysis of microRNA-mRNA integration ("HisCoM-mimi") model that accounts for this biological relationship, to efficiently study and identify such integrated markers. In simulation studies, HisCoM-mimi showed the better performance than the other three methods. Also, in real data analysis, HisCoM-mimi successfully identified more gives more informative miRNA-mRNA integration sets relationships for pancreatic ductal adenocarcinoma (PDAC) diagnosis, compared to the other methods. CONCLUSION: As exemplified by an application to pancreatic cancer data, our proposed model effectively identified integrated miRNA/target mRNA pairs as markers for early diagnosis, providing a much broader biological interpretation.


Asunto(s)
MicroARNs/metabolismo , Modelos Genéticos , Algoritmos , Biomarcadores de Tumor/genética , Biomarcadores de Tumor/metabolismo , Carcinoma Ductal Pancreático/genética , Simulación por Computador , Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Humanos , MicroARNs/genética , Neoplasias Pancreáticas/genética , ARN Mensajero/genética , ARN Mensajero/metabolismo
20.
BMC Bioinformatics ; 19(Suppl 4): 79, 2018 05 08.
Artículo en Inglés | MEDLINE | ID: mdl-29745849

RESUMEN

BACKGROUND: As one possible solution to the "missing heritability" problem, many methods have been proposed that apply pathway-based analyses, using rare variants that are detected by next generation sequencing technology. However, while a number of methods for pathway-based rare-variant analysis of multiple phenotypes have been proposed, no method considers a unified model that incorporate multiple pathways. RESULTS: Simulation studies successfully demonstrated advantages of multivariate analysis, compared to univariate analysis, and comparison studies showed the proposed approach to outperform existing methods. Moreover, real data analysis of six type 2 diabetes-related traits, using large-scale whole exome sequencing data, identified significant pathways that were not found by univariate analysis. Furthermore, strong relationships between the identified pathways, and their associated metabolic disorder risk factors, were found via literature search, and one of the identified pathway, was successfully replicated by an analysis with an independent dataset. CONCLUSIONS: Herein, we present a powerful, pathway-based approach to investigate associations between multiple pathways and multiple phenotypes. By reflecting the natural hierarchy of biological behavior, and considering correlation between pathways and phenotypes, the proposed method is capable of analyzing multiple phenotypes and multiple pathways simultaneously.


Asunto(s)
Variación Genética , Transducción de Señal/genética , Algoritmos , Simulación por Computador , Bases de Datos Genéticas , Diabetes Mellitus Tipo 2/genética , Exoma/genética , Humanos , Modelos Genéticos , Análisis Multivariante , Fenotipo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA