Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 39
Filter
Add more filters

Publication year range
1.
Nature ; 606(7914): 527-534, 2022 06.
Article in English | MEDLINE | ID: mdl-35676474

ABSTRACT

Missing heritability in genome-wide association studies defines a major problem in genetic analyses of complex biological traits1,2. The solution to this problem is to identify all causal genetic variants and to measure their individual contributions3,4. Here we report a graph pangenome of tomato constructed by precisely cataloguing more than 19 million variants from 838 genomes, including 32 new reference-level genome assemblies. This graph pangenome was used for genome-wide association study analyses and heritability estimation of 20,323 gene-expression and metabolite traits. The average estimated trait heritability is 0.41 compared with 0.33 when using the single linear reference genome. This 24% increase in estimated heritability is largely due to resolving incomplete linkage disequilibrium through the inclusion of additional causal structural variants identified using the graph pangenome. Moreover, by resolving allelic and locus heterogeneity, structural variants improve the power to identify genetic factors underlying agronomically important traits leading to, for example, the identification of two new genes potentially contributing to soluble solid content. The newly identified structural variants will facilitate genetic improvement of tomato through both marker-assisted selection and genomic selection. Our study advances the understanding of the heritability of complex traits and demonstrates the power of the graph pangenome in crop breeding.


Subject(s)
Genetic Variation , Genome, Plant , Genome-Wide Association Study , Plant Breeding , Solanum lycopersicum , Alleles , Crops, Agricultural/genetics , Genome, Plant/genetics , Linkage Disequilibrium , Solanum lycopersicum/genetics , Solanum lycopersicum/metabolism
2.
Am J Hum Genet ; 111(4): 680-690, 2024 Apr 04.
Article in English | MEDLINE | ID: mdl-38490208

ABSTRACT

We propose TetraHer, a method for estimating the liability heritability of binary phenotypes. TetraHer has five key features. First, it can be applied to data from complex pedigrees that contain multiple types of relationships. Second, it can correct for ascertainment of cases or controls. Third, it can accommodate covariates. Fourth, it can model the contribution of common environment. Fifth, it produces a likelihood that can be used for significance testing. We first demonstrate the validity of TetraHer on simulated data. We then use TetraHer to estimate liability heritability for 229 codes from the tenth International Classification of Diseases (ICD-10). We identify 107 codes with significant heritability (p < 0.05/229), which can be used in future analyses for investigating the genetic architecture of human diseases.


Subject(s)
Genome-Wide Association Study , Models, Genetic , Humans , Pedigree , Phenotype , Polymorphism, Single Nucleotide
3.
Am J Hum Genet ; 110(1): 23-29, 2023 01 05.
Article in English | MEDLINE | ID: mdl-36480927

ABSTRACT

We present LDAK-GBAT, a tool for gene-based association testing using summary statistics from genome-wide association studies that is computationally efficient, produces well-calibrated p values, and is significantly more powerful than existing tools. LDAK-GBAT takes approximately 30 min to analyze imputed data (2.9M common, genic SNPs), requiring less than 10 Gb memory. It shows good control of type 1 error given an appropriate reference panel. Across 109 phenotypes (82 from the UK Biobank, 18 from the Million Veteran Program, and nine from the Psychiatric Genetics Consortium), LDAK-GBAT finds on average 19% (SE: 1%) more significant genes than the existing tool sumFREGAT-ACAT, with even greater gains in comparison with MAGMA, GCTA-fastBAT, sumFREGAT-SKAT-O, and sumFREGAT-PCA.


Subject(s)
Genetic Testing , Genome-Wide Association Study , Phenotype , Polymorphism, Single Nucleotide/genetics
4.
Ann Hum Genet ; 88(3): 212-246, 2024 May.
Article in English | MEDLINE | ID: mdl-38161273

ABSTRACT

OBJECTIVE: The genome-wide association studies (GWAS) analysis, the most successful technique for discovering disease-related genetic variation, has some statistical concerns, including multiple testing, the correlation among variants (single-nucleotide polymorphisms) based on linkage disequilibrium and omitting the important variants when fitting the model with just one variant. To eliminate these problems in a small sample-size study, we used a sparse Bayesian learning model for finding bipolar disorder (BD) genetic variants. METHODS: This study used the Wellcome Trust Case Control Consortium data set, including 1998 BD cases and 1500 control samples, and after quality control, 380,628 variants were analysed. In this GWAS, a Bayesian logistic model with hierarchical shrinkage spike and slab priors was used, with all variants considered simultaneously in one model. In order to decrease the computational burden, an alternative inferential method, Bayesian variational inference, has been used. RESULTS: Thirteen variants were selected as associated with BD. The three of them (rs7572953, rs1378850 and rs4148944) were reported in previous GWAS. Eight of which were related to hemogram parameters, such as lymphocyte percentage, plateletcrit and haemoglobin concentration. Among selected related genes, GABPA, ELF3 and JAM2 were enriched in the platelet-derived growth factor pathway. These three genes, along with APP, ARL8A, CDH23 and GPR37L1, could be differential diagnostic variants for BD. CONCLUSIONS: By reducing the statistical restrictions of GWAS analysis, the application of the Bayesian variational spike and slab models can offer insight into the genetic link with BD even with a small sample size. To uncover related variations with other traits, this model needs to be further examined.


Subject(s)
Bipolar Disorder , Genome-Wide Association Study , Humans , Genome-Wide Association Study/methods , Bipolar Disorder/genetics , Bipolar Disorder/metabolism , Bayes Theorem , Genetic Predisposition to Disease , Linkage Disequilibrium , Polymorphism, Single Nucleotide , Receptors, G-Protein-Coupled/genetics
5.
Bioessays ; 44(5): e2100170, 2022 05.
Article in English | MEDLINE | ID: mdl-35279859

ABSTRACT

Complex-trait genetics has advanced dramatically through methods to estimate the heritability tagged by SNPs, both genome-wide and in genomic regions of interest such as those defined by functional annotations. The models underlying many of these analyses are inadequate, and consequently many SNP-heritability results published to date are inaccurate. Here, we review the modelling issues, both for analyses based on individual genotype data and association test statistics, highlighting the role of a low-dimensional model for the heritability of each SNP. We use state-of-art models to present updated results about how heritability is distributed with respect to functional annotations in the human genome, and how it varies with allele frequency, which can reflect purifying selection. Our results give finer detail to the picture that has emerged in recent years of complex trait heritability widely dispersed across the genome. Confounding due to population structure remains a problem that summary statistic analyses cannot reliably overcome. Also see the video abstract here: https://youtu.be/WC2u03V65MQ.


Subject(s)
Genome-Wide Association Study , Polymorphism, Single Nucleotide , Gene Frequency , Genome, Human/genetics , Genome-Wide Association Study/methods , Genotype , Humans , Models, Genetic , Phenotype , Polymorphism, Single Nucleotide/genetics , Quantitative Trait, Heritable
6.
Bipolar Disord ; 25(1): 25-31, 2023 02.
Article in English | MEDLINE | ID: mdl-36377279

ABSTRACT

OBJECTIVES: Bipolar disorder is associated with increased body mass index (BMI), but it remains undetermined if this association is causal and, if so, in which direction it goes. Here, we sought to answer these questions using bidirectional two-sample Mendelian randomization, a method from genetic epidemiology that uses data from genome-wide association studies (GWAS) to examine whether a risk factor is causal for an outcome METHODS: We used summary statistics from GWAS of bipolar disorder and BMI conducted using data collected by the Psychiatric Genomics Consortium and the UK Biobank, respectively. The genetic instrument for bipolar disorder contained 53 SNPs and explained 0.5% of phenotypic variance, while the genetic instrument for BMI contained 517 SNPs and explained 7.1% of phenotypic variance RESULTS: Our findings suggest that genetic liability to bipolar disorder reduces BMI (slope from Egger regression = -0.195, p = 0.004). It follows that a twofold increase in the genetic liability to bipolar disorder leads to a 0.6 (kg/m2 ) reduction in BMI, predominantly driven by reduced fat mass. Conversely, we found no evidence that BMI causes changes in the risk of developing bipolar disorder CONCLUSION: The results of this study suggest that the increased BMI observed among individuals with bipolar disorder is not a direct consequence of genetic liability to bipolar disorder, but may more likely represent the sum of downstream correlates of manifest bipolar disorder, such as side effects of pharmacological treatment, poor diet, and sedentary lifestyle. As these factors are all modifiable, they can be targeted as part of clinical management.


Subject(s)
Bipolar Disorder , Humans , Genome-Wide Association Study , Body Mass Index , Mendelian Randomization Analysis , Risk Factors , Polymorphism, Single Nucleotide
7.
Acta Psychiatr Scand ; 148(5): 447-456, 2023 Nov.
Article in English | MEDLINE | ID: mdl-37607129

ABSTRACT

OBJECTIVE: Mendelian randomization studies report a bi-directional relation between cigarette smoking and mental disorders, yet from a clinical standpoint, mental disorders are the focus of treatment. Here, we used an event history framework to understand their evolution in the life course. Our objective was to estimate the relative contribution of genetic predispositions and self-reported smoking status (never, former, and present smoker) to hospitalizations for major depression, bipolar disorder, and schizophrenia. METHODS: We calculated polygenic risk scores (PRS) for ever smoking, pack-years of smoking as a proportion of adult life, and neuroticism in 337,140 UK Biobank participants of white British ancestry. These PRS and self-reported smoking status were entered as explanatory variables in survival models for hospitalization. RESULTS: The estimated single nucleotide polymorphisms heritabilities (h2 ) were 23%, 5.7%, and 5.7% for pack-years, ever smoking, and neuroticism respectively. PRS pack-years and PRS neuroticism were associated with higher hospitalization risk for mental disorders in all smoking status groups. The hazard for mental health hospitalization was higher in both previous (HR: 1.50, CI: 1.35-1.67) and current (HR: 3.58, 2.97-4.31) compared to never smokers, after adjusting for confounders. CONCLUSION: Since genetic liabilities for smoking and neuroticism are fixed at conception and smoking initiation generally started before age 20, our results show that preventing smoking in adolescents probably prevents the development of mental disorders.

8.
Anim Genet ; 53(6): 872-877, 2022 Dec.
Article in English | MEDLINE | ID: mdl-36210489

ABSTRACT

Equine recurrent uveitis (ERU) is a blinding ocular disorder among horses, and the Appaloosa horse breed is disproportionally affected by a chronic form of this intraocular inflammatory disease known as insidious uveitis. Strong breed predisposition and previous investigations suggest that there is a genetic component to the pathology of insidious uveitis among Appaloosa horses; however, no estimates of the heritability of the disease have previously been determined. This study aimed to characterize the genetic underpinning of the disease by estimating the heritability for insidious uveitis among Appaloosas. After combining two genotyping array datasets from the Illumina Equine SNP70 BeadChip and the Axiom Equine 670 K Genotyping Array, heritability was estimated for 59 affected and 83 unaffected horses using both restricted maximum likelihood (REML) and phenotype correlation - genotype correlation solvers from the linkage disequilibrium adjusted kinship software. Based on previous research, age and sex were used as covariates, and the locus responsible for the characteristic Appaloosa coat pattern (LP), previously associated with ERU risk, was included as a fixed effect ('top predictor'). Using prevalence values from 0.05 to 0.42, the heritability estimate for insidious uveitis ranged from 0.95 (SE = 0.14) to 1.74 (SE = 0.25) with LP contributing 0.16-0.33 to the estimate. This study suggests that insidious uveitis is highly heritable (REML 95% CI, h2  = 0.68-1.0) and additional loci outside of LP are contributing to the genetic risk for insidious uveitis for Appaloosas. Once identified, these other genetic factors may lead to new disease mitigation efforts in veterinary care and breeding practices.


Subject(s)
Horse Diseases , Uveitis , Horses/genetics , Animals , Horse Diseases/genetics , Horse Diseases/epidemiology , Uveitis/genetics , Uveitis/veterinary , Genotype , Risk Factors
9.
Nat Rev Genet ; 16(1): 33-44, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25404112

ABSTRACT

Relatedness is a fundamental concept in genetics but is surprisingly hard to define in a rigorous yet useful way. Traditional relatedness coefficients specify expected genome sharing between individuals in pedigrees, but actual genome sharing can differ considerably from these expected values, which in any case vary according to the pedigree that happens to be available. Nowadays, we can measure genome sharing directly from genome-wide single-nucleotide polymorphism (SNP) data; however, there are many such measures in current use, and we lack good criteria for choosing among them. Here, we review SNP-based measures of relatedness and criteria for comparing them. We discuss how useful pedigree-based concepts remain today and highlight opportunities for further advances in quantitative genetics, with a focus on heritability estimation and phenotype prediction.


Subject(s)
Genetic Variation , Genetics, Population/methods , Models, Genetic , Pedigree , Phenotype , Polymorphism, Single Nucleotide/genetics , Computer Simulation , Humans
10.
Addict Biol ; 26(6): e13030, 2021 11.
Article in English | MEDLINE | ID: mdl-33733564

ABSTRACT

Cannabis use is associated with a number of psychiatric disorders; however, the causal nature of these associations has been difficult to establish. Mendelian randomization (MR) offers a way to infer causality between exposures with known genetic predictors (genome-wide significant single nucleotide polymorphisms [SNPs]) and outcomes of interest. MR has previously been applied to investigate the relationship between lifetime cannabis use (having ever used cannabis) and schizophrenia, depression, and attention deficit hyperactivity disorder (ADHD), but not bipolar disorder, representing a gap in the literature. We conducted a two-sample bidirectional MR study on the relationship between bipolar disorder and lifetime cannabis use. Genetic instruments (SNPs) were obtained from the summary statistics of recent large genome-wide association studies (GWAS). We conducted a two-sample bidirectional MR study on the relationship between bipolar disorder and lifetime cannabis use using inverse variance weighted regression, weighted median regression, and Egger regression. Genetic liability to bipolar disorder was significantly associated with an increased risk of lifetime cannabis use; however, genetic liability to lifetime cannabis use showed no association with the risk of bipolar disorder. The sensitivity analyses showed no evidence for pleiotropic effects. The present findings support a causal effect of liability to bipolar disorder on the risk of using cannabis at least once. No evidence was found for a causal effect of liability to cannabis use on the risk of bipolar disorder. These findings add important new knowledge to the understanding of the complex relationship between cannabis use and psychiatric disorders.


Subject(s)
Bipolar Disorder/epidemiology , Bipolar Disorder/genetics , Marijuana Abuse/epidemiology , Marijuana Abuse/genetics , Databases, Genetic , Humans , Mendelian Randomization Analysis , Polymorphism, Single Nucleotide , Regression Analysis
11.
Genet Epidemiol ; 43(8): 930-940, 2019 12.
Article in English | MEDLINE | ID: mdl-31541496

ABSTRACT

Linkage disequilibrium SCore regression (LDSC) has become a popular approach to estimate confounding bias, heritability, and genetic correlation using only genome-wide association study (GWAS) test statistics. SumHer is a newly introduced alternative with similar aims. We show using theory and simulations that both approaches fail to adequately account for confounding bias, even when the assumed heritability model is correct. Consequently, these methods may estimate heritability poorly if there was an inadequate adjustment for confounding in the original GWAS analysis. We also show that the choice of a summary statistic for use in LDSC or SumHer can have a large impact on resulting inferences. Further, covariate adjustments in the original GWAS can alter the target of heritability estimation, which can be problematic for test statistics from a meta-analysis of GWAS with different covariate adjustments.


Subject(s)
Bias , Data Interpretation, Statistical , Inheritance Patterns , Models, Genetic , Computer Simulation , Genome-Wide Association Study , Humans , Linkage Disequilibrium , Polymorphism, Single Nucleotide
12.
Heredity (Edinb) ; 124(6): 751-762, 2020 06.
Article in English | MEDLINE | ID: mdl-32273574

ABSTRACT

Estimating total narrow-sense heritability in admixed populations remains an open question. In this work, we used extensive simulations to evaluate existing linear mixed-model frameworks for estimating total narrow-sense heritability in two population-based cohorts from Greenland, and compared the results with data from unadmixed individuals from Denmark. When our analysis focused on Greenlandic sib pairs, and under the assumption that shared environment among siblings has a negligible effect, the model with two relationship matrices, one capturing identity by descent and one capturing identity by state, returned heritability estimates close to the true simulated value, while using each of the two matrices alone led to downward biases. When phenotypes correlated with ancestry, heritability estimates were inflated. Based on these observations, we propose a PCA-based adjustment that recovers the true simulated heritability. We use this knowledge to estimate the heritability of ten quantitative traits from the two Greenlandic cohorts, and report differences such as lower heritability for height in Greenlanders compared with Europeans. In conclusion, narrow-sense heritability in admixed populations is best estimated when using a mixture of genetic relationship matrices on individuals with at least one first-degree relative included in the sample.


Subject(s)
Genetics, Population , Models, Genetic , White People , Denmark , Greenland , Humans , Linear Models , Phenotype , Quantitative Trait, Heritable , White People/genetics
13.
Ann Neurol ; 84(2): 191-199, 2018 08.
Article in English | MEDLINE | ID: mdl-30014513

ABSTRACT

OBJECTIVE: Observational studies have shown that increased plasma urate is associated with lower risk of Parkinson's disease (PD), but these studies were not designed to test causality. If a causal relationship exists, then modulating plasma urate levels could be a potential preventive avenue for PD. We used a large two-sample Mendelian randomization (MR) design to assess for a causal relationship between plasma urate and PD risk. METHODS: We used a genetic instrument consisting of 31 independent loci for plasma urate on a case-control genome-wide association study data set, which included 13,708 PD cases and 95,282 controls. Individual effect estimates for each SNP were combined using the inverse-variance weighted (IVW) method. Two additional methods, MR-Egger and a penalized weighted median (PWM)-based approach, were used to assess potential bias attributed to pleiotropy or invalid instruments. RESULTS: We found no evidence for a causal relationship between urate and PD, with an effect estimate from the IVW method of odds ratio (OR) 1.03 (95% confidence interval [CI], 0.88-1.20) per 1-standard-deviation increase in plasma urate levels. MR Egger and PWM analyses yielded similar estimates (OR, 0.99 [95% CI, 0.83-1.17] and 0.99 [95% CI, 0.86-1.14], respectively). INTERPRETATION: We did not find evidence for a linear causal protective effect by urate on PD risk. The associations observed in previous observational studies may be, in part, attributed to confounding or reverse causality. In the context of the present findings, strategies to elevate circulating urate levels may not reduce overall PD risk. Ann Neurol 2018;84:191-199.


Subject(s)
Genetic Variation/genetics , Mendelian Randomization Analysis/methods , Parkinson Disease/blood , Parkinson Disease/genetics , Polymorphism, Single Nucleotide/genetics , Uric Acid/blood , Biomarkers/blood , Databases, Genetic/trends , Humans , Parkinson Disease/diagnosis
14.
Nature ; 486(7403): 346-52, 2012 Apr 18.
Article in English | MEDLINE | ID: mdl-22522925

ABSTRACT

The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA­RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the 'CNA-devoid' subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.


Subject(s)
Breast Neoplasms/genetics , Breast Neoplasms/pathology , DNA Copy Number Variations/genetics , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Genome, Human/genetics , Breast Neoplasms/classification , Breast Neoplasms/diagnosis , Female , Gene Regulatory Networks/genetics , Genes, Neoplasm/genetics , Genomics , Humans , Kaplan-Meier Estimate , MAP Kinase Kinase 4/genetics , Polymorphism, Single Nucleotide/genetics , Prognosis , Protein Phosphatase 2/genetics , Treatment Outcome
15.
Am J Med Genet B Neuropsychiatr Genet ; 177(7): 641-657, 2018 10.
Article in English | MEDLINE | ID: mdl-30325587

ABSTRACT

Individuals with psychiatric disorders have elevated rates of autoimmune comorbidity and altered immune signaling. It is unclear whether these altered immunological states have a shared genetic basis with those psychiatric disorders. The present study sought to use existing summary-level data from previous genome-wide association studies to determine if commonly varying single nucleotide polymorphisms are shared between psychiatric and immune-related phenotypes. We estimated heritability and examined pair-wise genetic correlations using the linkage disequilibrium score regression (LDSC) and heritability estimation from summary statistics methods. Using LDSC, we observed significant genetic correlations between immune-related disorders and several psychiatric disorders, including anorexia nervosa, attention deficit-hyperactivity disorder, bipolar disorder, major depression, obsessive compulsive disorder, schizophrenia, smoking behavior, and Tourette syndrome. Loci significantly mediating genetic correlations were identified for schizophrenia when analytically paired with Crohn's disease, primary biliary cirrhosis, systemic lupus erythematosus, and ulcerative colitis. We report significantly correlated loci and highlight those containing genome-wide associations and candidate genes for respective disorders. We also used the LDSC method to characterize genetic correlations among the immune-related phenotypes. We discuss our findings in the context of relevant genetic and epidemiological literature, as well as the limitations and caveats of the study.


Subject(s)
Autoimmune Diseases/genetics , Mental Disorders/genetics , Autoimmune Diseases/physiopathology , Comorbidity , Databases, Factual , Female , Genetic Predisposition to Disease , Genome-Wide Association Study/methods , Humans , Linkage Disequilibrium , Male , Mental Disorders/physiopathology , Multifactorial Inheritance , Polymorphism, Single Nucleotide , White People/genetics
16.
Gastroenterology ; 151(4): 698-709, 2016 10.
Article in English | MEDLINE | ID: mdl-27373512

ABSTRACT

BACKGROUND & AIMS: Crohn's disease (CD) is a highly heritable disease that is particularly common in the Ashkenazi Jewish population. We studied 2 large Ashkenazi Jewish families with a high prevalence of CD in an attempt to identify novel genetic risk variants. METHODS: Ashkenazi Jewish patients with CD and a positive family history were recruited from the University College London Hospital. We used genome-wide, single-nucleotide polymorphism data to assess the burden of common CD-associated risk variants and for linkage analysis. Exome sequencing was performed and rare variants that were predicted to be deleterious and were observed at a high frequency in cases were prioritized. We undertook within-family association analysis after imputation and assessed candidate variants for evidence of association with CD in an independent cohort of Ashkenazi Jewish individuals. We examined the effects of a variant in DUOX2 on hydrogen peroxide production in HEK293 cells. RESULTS: We identified 2 families (1 with >800 members and 1 with >200 members) containing 54 and 26 cases of CD or colitis, respectively. Both families had a significant enrichment of previously described common CD-associated risk variants. No genome-wide significant linkage was observed. Exome sequencing identified candidate variants, including a missense mutation in DUOX2 that impaired its function and a frameshift mutation in CSF2RB that was associated with CD in an independent cohort of Ashkenazi Jewish individuals. CONCLUSIONS: In a study of 2 large Ashkenazi Jewish with multiple cases of CD, we found the genetic basis of the disease to be complex, with a role for common and rare genetic variants. We identified a frameshift mutation in CSF2RB that was replicated in an independent cohort. These findings show the value of family studies and the importance of the innate immune system in the pathogenesis of CD.


Subject(s)
Crohn Disease/genetics , Cytokine Receptor Common beta Subunit/genetics , Jews/genetics , NADPH Oxidases/genetics , Pedigree , Adolescent , Age of Onset , Crohn Disease/ethnology , Dual Oxidases , Exome , Female , Frameshift Mutation , Genetic Linkage , Genetic Predisposition to Disease , HEK293 Cells/metabolism , Humans , Male , Molecular Sequence Data , Mutation, Missense , Polymorphism, Single Nucleotide , Risk Factors , Young Adult
17.
Genome Res ; 24(9): 1550-7, 2014 Sep.
Article in English | MEDLINE | ID: mdl-24963154

ABSTRACT

BLUP (best linear unbiased prediction) is widely used to predict complex traits in plant and animal breeding, and increasingly in human genetics. The BLUP mathematical model, which consists of a single random effect term, was adequate when kinships were measured from pedigrees. However, when genome-wide SNPs are used to measure kinships, the BLUP model implicitly assumes that all SNPs have the same effect-size distribution, which is a severe and unnecessary limitation. We propose MultiBLUP, which extends the BLUP model to include multiple random effects, allowing greatly improved prediction when the random effects correspond to classes of SNPs with distinct effect-size variances. The SNP classes can be specified in advance, for example, based on SNP functional annotations, and we also provide an adaptive procedure for determining a suitable partition of SNPs. We apply MultiBLUP to genome-wide association data from the Wellcome Trust Case Control Consortium (seven diseases), and from much larger studies of celiac disease and inflammatory bowel disease, finding that it consistently provides better prediction than alternative methods. Moreover, MultiBLUP is computationally very efficient; for the largest data set, which includes 12,678 individuals and 1.5 M SNPs, the total analysis can be run on a single desktop PC in less than a day and can be parallelized to run even faster. Tools to perform MultiBLUP are freely available in our software LDAK.


Subject(s)
Algorithms , Models, Genetic , Polymorphism, Single Nucleotide , Quantitative Trait, Heritable , Animals , Humans , Mice
18.
Hum Mol Genet ; 23(1): 247-58, 2014 Jan 01.
Article in English | MEDLINE | ID: mdl-23962720

ABSTRACT

We present the analysis of a prospective multicentre study to investigate genetic effects on the prognosis of newly treated epilepsy. Patients with a new clinical diagnosis of epilepsy requiring medication were recruited and followed up prospectively. The clinical outcome was defined as freedom from seizures for a minimum of 12 months in accordance with the consensus statement from the International League Against Epilepsy (ILAE). Genetic effects on remission of seizures after starting treatment were analysed with and without adjustment for significant clinical prognostic factors, and the results from each cohort were combined using a fixed-effects meta-analysis. After quality control (QC), we analysed 889 newly treated epilepsy patients using 472 450 genotyped and 6.9 × 10(6) imputed single-nucleotide polymorphisms. Suggestive evidence for association (defined as Pmeta < 5.0 × 10(-7)) with remission of seizures after starting treatment was observed at three loci: 6p12.2 (rs492146, Pmeta = 2.1 × 10(-7), OR[G] = 0.57), 9p23 (rs72700966, Pmeta = 3.1 × 10(-7), OR[C] = 2.70) and 15q13.2 (rs143536437, Pmeta = 3.2 × 10(-7), OR[C] = 1.92). Genes of biological interest at these loci include PTPRD and ARHGAP11B (encoding functions implicated in neuronal development) and GSTA4 (a phase II biotransformation enzyme). Pathway analysis using two independent methods implicated a number of pathways in the prognosis of epilepsy, including KEGG categories 'calcium signaling pathway' and 'phosphatidylinositol signaling pathway'. Through a series of power curves, we conclude that it is unlikely any single common variant explains >4.4% of the variation in the outcome of newly treated epilepsy.


Subject(s)
Epilepsy/diagnosis , Epilepsy/genetics , Genome-Wide Association Study , Adult , Anticonvulsants/therapeutic use , Calcium Signaling/genetics , Chromosomes, Human, Pair 15 , Chromosomes, Human, Pair 6 , Chromosomes, Human, Pair 9 , Epilepsy/drug therapy , Female , Genetic Predisposition to Disease , Genetic Variation , Humans , Male , Middle Aged , Phosphatidylinositols/genetics , Polymorphism, Single Nucleotide , Prognosis , Prospective Studies , Treatment Outcome , Young Adult
19.
Am J Hum Genet ; 91(6): 1011-21, 2012 Dec 07.
Article in English | MEDLINE | ID: mdl-23217325

ABSTRACT

Estimation of narrow-sense heritability, h(2), from genome-wide SNPs genotyped in unrelated individuals has recently attracted interest and offers several advantages over traditional pedigree-based methods. With the use of this approach, it has been estimated that over half the heritability of human height can be attributed to the ~300,000 SNPs on a genome-wide genotyping array. In comparison, only 5%-10% can be explained by SNPs reaching genome-wide significance. We investigated via simulation the validity of several key assumptions underpinning the mixed-model analysis used in SNP-based h(2) estimation. Although we found that the method is reasonably robust to violations of four key assumptions, it can be highly sensitive to uneven linkage disequilibrium (LD) between SNPs: contributions to h(2) are overestimated from causal variants in regions of high LD and are underestimated in regions of low LD. The overall direction of the bias can be up or down depending on the genetic architecture of the trait, but it can be substantial in realistic scenarios. We propose a modified kinship matrix in which SNPs are weighted according to local LD. We show that this correction greatly reduces the bias and increases the precision of h(2) estimates. We demonstrate the impact of our method on the first seven diseases studied by the Wellcome Trust Case Control Consortium. Our LD adjustment revises downward the h(2) estimate for immune-related diseases, as expected because of high LD in the major-histocompatibility region, but increases it for some nonimmune diseases. To calculate our revised kinship matrix, we developed LDAK, software for computing LD-adjusted kinships.


Subject(s)
Genome-Wide Association Study , Models, Genetic , Multifactorial Inheritance/genetics , Polymorphism, Single Nucleotide , Algorithms , Computer Simulation , Genome, Human , Genotype , Humans , Linkage Disequilibrium , Pedigree
20.
Brain ; 137(Pt 10): 2680-9, 2014 Oct.
Article in English | MEDLINE | ID: mdl-25063994

ABSTRACT

Epilepsy is a disease with substantial missing heritability; despite its high genetic component, genetic association studies have had limited success detecting common variants which influence susceptibility. In this paper, we reassess the role of common variants on epilepsy using extensions of heritability analysis. Our data set consists of 1258 UK patients with epilepsy, of which 958 have focal epilepsy, and 5129 population control subjects, with genotypes recorded for over 4 million common single nucleotide polymorphisms. Firstly, we show that on the liability scale, common variants collectively explain at least 26% (standard deviation 5%) of phenotypic variation for all epilepsy and 27% (standard deviation 5%) for focal epilepsy. Secondly we provide a new method for estimating the number of causal variants for complex traits; when applied to epilepsy, our most optimistic estimate suggests that at least 400 variants influence disease susceptibility, with potentially many thousands. Thirdly, we use bivariate analysis to assess how similar the genetic architecture of focal epilepsy is to that of non-focal epilepsy; we demonstrate both significant differences (P = 0.004) and significant similarities (P = 0.01) between the two subtypes, indicating that although the clinical definition of focal epilepsy does identify a genetically distinct epilepsy subtype, there is also scope to improve the classification of epilepsy by incorporating genotypic information. Lastly, we investigate the potential value in using genetic data to diagnose epilepsy following a single epileptic seizure; we find that a prediction model explaining 10% of phenotypic variation could have clinical utility for deciding which single-seizure individuals are likely to benefit from immediate anti-epileptic drug therapy.


Subject(s)
Epilepsy/genetics , Age of Onset , Algorithms , Area Under Curve , Asian People , Epilepsy/physiopathology , Genetic Loci , Genetic Predisposition to Disease , Genome-Wide Association Study , Genotype , Genotyping Techniques , Humans , Models, Statistical , Polymorphism, Single Nucleotide , Population , Quantitative Trait, Heritable , ROC Curve , Seizures/genetics , Seizures/physiopathology , White People
SELECTION OF CITATIONS
SEARCH DETAIL