RESUMEN
Biobanks aim to improve our understanding of health and disease by collecting and analysing diverse biological and phenotypic information in large samples. So far, biobanks have largely pursued a population-based sampling strategy, where the individual is the unit of sampling, and familial relatedness occurs sporadically and by chance. This strategy has been remarkably efficient and successful, leading to thousands of scientific discoveries across multiple research domains, and plans for the next wave of biobanks are underway. In this Perspective, we discuss the strengths and limitations of a complementary sampling strategy for future biobanks based on oversampling of close genetic relatives. Such family-based samples facilitate research that clarifies causal relationships between putative risk factors and outcomes, particularly in estimates of genetic effects, because they enable analyses that reduce or eliminate confounding due to familial and demographic factors. Family-based biobank samples would also shed new light on fundamental questions across multiple fields that are often difficult to explore in population-based samples. Despite the potential for higher costs and greater analytical complexity, the many advantages of family-based samples should often outweigh their potential challenges.
Asunto(s)
Bancos de Muestras Biológicas , Familia , HumanosRESUMEN
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.
Asunto(s)
Estatura , Mapeo Cromosómico , Polimorfismo de Nucleótido Simple , Humanos , Estatura/genética , Frecuencia de los Genes/genética , Genoma Humano/genética , Estudio de Asociación del Genoma Completo , Haplotipos/genética , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Europa (Continente)/etnología , Tamaño de la Muestra , FenotipoRESUMEN
It has been 15 years since the advent of the genome-wide association study (GWAS) era. Here, we review how this experimental design has realized its promise by facilitating an impressive range of discoveries with remarkable impact on multiple fields, including population genetics, complex trait genetics, epidemiology, social science, and medicine. We predict that the emergence of large-scale biobanks will continue to expand to more diverse populations and capture more of the allele frequency spectrum through whole-genome sequencing, which will further improve our ability to investigate the causes and consequences of human genetic variation for complex traits and diseases.
Asunto(s)
Genética de Población , Estudio de Asociación del Genoma Completo , Humanos , Frecuencia de los Genes , Herencia Multifactorial , Polimorfismo de Nucleótido SimpleRESUMEN
Vitamin D status-a complex trait influenced by environmental and genetic factors-is tightly associated with skin colour and ancestry. Yet very few studies have investigated the genetic underpinnings of vitamin D levels across diverse ancestries, and the ones that have, relied on small sample sizes, resulting in inconclusive results. Here, we conduct genome-wide association studies (GWAS) of 25 hydroxyvitamin D (25OHD)-the main circulating form of vitamin D-in 442,435 individuals from four broad genetically-determined ancestry groups represented in the UK Biobank: European (N = 421,867), South Asian (N = 9,983), African (N = 8,306) and East Asian (N = 2,279). We identify a new genetic determinant of 25OHD (rs146759773) in individuals of African ancestry, which was not detected in previous analysis of much larger European cohorts due to low minor allele frequency. We show genome-wide significant evidence of dominance effects in 25OHD that protect against vitamin D deficiency. Given that key events in the synthesis of 25OHD occur in the skin and are affected by pigmentation levels, we conduct GWAS of 25OHD stratified by skin colour and identify new associations. Lastly, we test the interaction between skin colour and variants associated with variance in 25OHD levels and identify two loci (rs10832254 and rs1352846) whose association with 25OHD differs in individuals of distinct complexions. Collectively, our results provide new insights into the complex relationship between 25OHD and skin colour and highlight the importance of diversity in genomic studies. Despite the much larger rates of vitamin D deficiency that we and others report for ancestry groups with dark skin (e.g., South Asian), our study highlights the importance of considering ancestral background and/or skin colour when assessing the implications of low vitamin D.
Asunto(s)
Estudio de Asociación del Genoma Completo , Deficiencia de Vitamina D , Humanos , Polimorfismo de Nucleótido Simple/genética , Vitamina D/genética , Deficiencia de Vitamina D/genéticaRESUMEN
Findings from genome-wide association studies have facilitated the generation of genetic predictors for many common human phenotypes. Stratifying individuals misaligned to a genetic predictor based on common variants may be important for follow-up studies that aim to identify alternative causal factors. Using genome-wide imputed genetic data, we aimed to classify 158,951 unrelated individuals from the UK Biobank as either concordant or deviating from two well-measured phenotypes. We first applied our methods to standing height: our primary analysis classified 244 individuals (0.15%) as misaligned to their genetically predicted height. We show that these individuals are enriched for self-reporting being shorter or taller than average at age 10, diagnosed congenital malformations, and rare loss-of-function variants in genes previously catalogued as causal for growth disorders. Secondly, we apply our methods to LDL cholesterol (LDL-C). We classified 156 (0.12%) individuals as misaligned to their genetically predicted LDL-C and show that these individuals were enriched for both clinically actionable cardiovascular risk factors and rare genetic variants in genes previously shown to be involved in metabolic processes. Individuals whose LDL-C was higher than expected based on the genetic predictor were also at higher risk of developing coronary artery disease and type-two diabetes, even after adjustment for measured LDL-C, BMI and age, suggesting upward deviation from genetically predicted LDL-C is indicative of generally poor health. Our results remained broadly consistent when performing sensitivity analysis based on a variety of parametric and non-parametric methods to define individuals deviating from polygenic expectation. Our analyses demonstrate the potential importance of quantitatively identifying individuals for further follow-up based on deviation from genetic predictions.
Asunto(s)
Enfermedad de la Arteria Coronaria , Estudio de Asociación del Genoma Completo , Humanos , Niño , LDL-Colesterol/genética , Fenotipo , Enfermedad de la Arteria Coronaria/genética , Estudios de Seguimiento , Análisis de la Aleatorización Mendeliana , Factores de Riesgo , Polimorfismo de Nucleótido SimpleRESUMEN
Across species, offspring of related individuals often exhibit significant reduction in fitness-related traits, known as inbreeding depression (ID), yet the genetic and molecular basis for ID remains elusive. Here, we develop a method to quantify enrichment of ID within specific genomic annotations and apply it to human data. We analyzed the phenomes and genomes of â¼350,000 unrelated participants of the UK Biobank and found, on average of over 11 traits, significant enrichment of ID within genomic regions with high recombination rates (>21-fold; p < 10-5), with conserved function across species (>19-fold; p < 10-4), and within regulatory elements such as DNase I hypersensitive sites (â¼5-fold; p = 8.9 × 10-7). We also quantified enrichment of ID within trait-associated regions and found suggestive evidence that genomic regions contributing to additive genetic variance in the population are enriched for ID signal. We find strong correlations between functional enrichment of SNP-based heritability and that of ID (r = 0.8, standard error: 0.1). These findings provide empirical evidence that ID is most likely due to many partially recessive deleterious alleles in low linkage disequilibrium regions of the genome. Our study suggests that functional characterization of ID may further elucidate the genetic architectures and biological mechanisms underlying complex traits and diseases.
Asunto(s)
Estudio de Asociación del Genoma Completo , Genómica/métodos , Depresión Endogámica/genética , Desequilibrio de Ligamiento , Herencia Multifactorial/genética , Fenotipo , Polimorfismo de Nucleótido Simple , Femenino , Humanos , MasculinoRESUMEN
Non-additive genetic variance for complex traits is traditionally estimated from data on relatives. It is notoriously difficult to estimate without bias in non-laboratory species, including humans, because of possible confounding with environmental covariance among relatives. In principle, non-additive variance attributable to common DNA variants can be estimated from a random sample of unrelated individuals with genome-wide SNP data. Here, we jointly estimate the proportion of variance explained by additive (hSNP2), dominance (δSNP2) and additive-by-additive (ηSNP2) genetic variance in a single analysis model. We first show by simulations that our model leads to unbiased estimates and provide a new theory to predict standard errors estimated using either least-squares or maximum likelihood. We then apply the model to 70 complex traits using 254,679 unrelated individuals from the UK Biobank and 1.1 M genotyped and imputed SNPs. We found strong evidence for additive variance (average across traits h¯SNP2=0.208). In contrast, the average estimate of δ¯SNP2 across traits was 0.001, implying negligible dominance variance at causal variants tagged by common SNPs. The average epistatic variance η¯SNP2 across the traits was 0.055, not significantly different from zero because of the large sampling variance. Our results provide new evidence that genetic variance for complex traits is predominantly additive and that sample sizes of many millions of unrelated individuals are needed to estimate epistatic variance with sufficient precision.
Asunto(s)
Conjuntos de Datos como Asunto , Herencia Multifactorial/genética , Polimorfismo de Nucleótido Simple/genética , Bancos de Muestras Biológicas , Epistasis Genética , Femenino , Genotipo , Humanos , Masculino , Modelos Genéticos , Fenotipo , Reproducibilidad de los Resultados , Reino UnidoRESUMEN
STUDY QUESTION: Which genetic factors regulate female propensity for giving birth to spontaneous dizygotic (DZ) twins? SUMMARY ANSWER: We identified four new loci, GNRH1, FSHR, ZFPM1, and IPO8, in addition to previously identified loci, FSHB and SMAD3. WHAT IS KNOWN ALREADY: The propensity to give birth to DZ twins runs in families. Earlier, we reported that FSHB and SMAD3 as associated with DZ twinning and female fertility measures. STUDY DESIGN, SIZE, DURATION: We conducted a genome-wide association meta-analysis (GWAMA) of mothers of spontaneous dizygotic (DZ) twins (8265 cases, 264â567 controls) and of independent DZ twin offspring (26â252 cases, 417â433 controls). PARTICIPANTS/MATERIALS, SETTING, METHODS: Over 700â000 mothers of DZ twins, twin individuals and singletons from large cohorts in Australia/New Zealand, Europe, and the USA were carefully screened to exclude twins born after use of ARTs. Genetic association analyses by cohort were followed by meta-analysis, phenome wide association studies (PheWAS), in silico and in vivo annotations, and Zebrafish functional validation. MAIN RESULTS AND THE ROLE OF CHANCE: This study enlarges the sample size considerably from previous efforts, finding four genome-wide significant loci, including two novel signals and a further two novel genes that are implicated by gene level enrichment analyses. The novel loci, GNRH1 and FSHR, have well-established roles in female reproduction whereas ZFPM1 and IPO8 have not previously been implicated in female fertility. We found significant genetic correlations with multiple aspects of female reproduction and body size as well as evidence for significant selection against DZ twinning during human evolution. The 26 top single nucleotide polymorphisms (SNPs) from our GWAMA in European-origin participants weakly predicted the crude twinning rates in 47 non-European populations (r = 0.23 between risk score and population prevalence, s.e. 0.11, 1-tail P = 0.058) indicating that genome-wide association studies (GWAS) are needed in African and Asian populations to explore the causes of their respectively high and low DZ twinning rates. In vivo functional tests in zebrafish for IPO8 validated its essential role in female, but not male, fertility. In most regions, risk SNPs linked to known expression quantitative trait loci (eQTLs). Top SNPs were associated with in vivo reproductive hormone levels with the top pathways including hormone ligand binding receptors and the ovulation cycle. LARGE SCALE DATA: The full DZT GWAS summary statistics will made available after publication through the GWAS catalog (https://www.ebi.ac.uk/gwas/). LIMITATIONS, REASONS FOR CAUTION: Our study only included European ancestry cohorts. Inclusion of data from Africa (with the highest twining rate) and Asia (with the lowest rate) would illuminate further the biology of twinning and female fertility. WIDER IMPLICATIONS OF THE FINDINGS: About one in 40 babies born in the world is a twin and there is much speculation on why twinning runs in families. We hope our results will inform investigations of ovarian response in new and existing ARTs and the causes of female infertility. STUDY FUNDING/COMPETING INTEREST(S): Support for the Netherlands Twin Register came from the Netherlands Organization for Scientific Research (NWO) and The Netherlands Organization for Health Research and Development (ZonMW) grants, 904-61-193, 480-04-004, 400-05-717, Addiction-31160008, 911-09-032, Biobanking and Biomolecular Resources Research Infrastructure (BBMRI.NL, 184.021.007), Royal Netherlands Academy of Science Professor Award (PAH/6635) to DIB, European Research Council (ERC-230374), Rutgers University Cell and DNA Repository (NIMH U24 MH068457-06), the Avera Institute, Sioux Falls, South Dakota (USA) and the National Institutes of Health (NIH R01 HD042157-01A1) and the Genetic Association Information Network (GAIN) of the Foundation for the National Institutes of Health and Grand Opportunity grants 1RC2 MH089951. The QIMR Berghofer Medical Research Institute (QIMR) study was supported by grants from the National Health and Medical Research Council (NHMRC) of Australia (241944, 339462, 389927, 389875, 389891, 389892, 389938, 443036, 442915, 442981, 496610, 496739, 552485, 552498, 1050208, 1075175). L.Y. is funded by Australian Research Council (Grant number DE200100425). The Minnesota Center for Twin and Family Research (MCTFR) was supported in part by USPHS Grants from the National Institute on Alcohol Abuse and Alcoholism (AA09367 and AA11886) and the National Institute on Drug Abuse (DA05147, DA13240, and DA024417). The Women's Genome Health Study (WGHS) was funded by the National Heart, Lung, and Blood Institute (HL043851 and HL080467) and the National Cancer Institute (CA047988 and UM1CA182913), with support for genotyping provided by Amgen. Data collection in the Finnish Twin Registry has been supported by the Wellcome Trust Sanger Institute, the Broad Institute, ENGAGE-European Network for Genetic and Genomic Epidemiology, FP7-HEALTH-F4-2007, grant agreement number 201413, National Institute of Alcohol Abuse and Alcoholism (grants AA-12502, AA-00145, AA-09203, AA15416, and K02AA018755) and the Academy of Finland (grants 100499, 205585, 118555, 141054, 264146, 308248, 312073 and 336823 to J. Kaprio). TwinsUK is funded by the Wellcome Trust, Medical Research Council, Versus Arthritis, European Union Horizon 2020, Chronic Disease Research Foundation (CDRF), Zoe Ltd and the National Institute for Health Research (NIHR) Clinical Research Network (CRN) and Biomedical Research Centre based at Guy's and St Thomas' NHS Foundation Trust in partnership with King's College London. For NESDA, funding was obtained from the Netherlands Organization for Scientific Research (Geestkracht program grant 10000-1002), the Center for Medical Systems Biology (CSMB, NVVO Genomics), Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-NL), VU University's Institutes for Health and Care Research (EMGO+) and Neuroscience Campus Amsterdam, University Medical Center Groningen, Leiden University Medical Center, National Institutes of Health (NIH, ROI D0042157-01A, MH081802, Grand Opportunity grants 1 RC2 Ml-1089951 and IRC2 MH089995). Part of the genotyping and analyses were funded by the Genetic Association Information Network (GAIN) of the Foundation for the National Institutes of Health. Computing was supported by BiG Grid, the Dutch e-Science Grid, which is financially supported by NWO. Work in the Del Bene lab was supported by the Programme Investissements d'Avenir IHU FOReSIGHT (ANR-18-IAHU-01). C.R. was supported by an EU Horizon 2020 Marie Sklodowska-Curie Action fellowship (H2020-MSCA-IF-2014 #661527). H.S. and K.S. are employees of deCODE Genetics/Amgen. The other authors declare no competing financial interests. TRIAL REGISTRATION NUMBER: N/A.
Asunto(s)
Fertilidad , Estudio de Asociación del Genoma Completo , Gemelación Dicigótica , Animales , Femenino , Humanos , Embarazo , Proteínas Portadoras/genética , Fertilidad/genética , Hormonas , Proteínas/genética , Estados Unidos , Pez Cebra/genéticaRESUMEN
Samples can be prone to ascertainment and attrition biases. The Australian Genetics of Depression Study is a large publicly recruited cohort (n = 20,689) established to increase the understanding of depression and antidepressant treatment response. This study investigates differences between participants who donated a saliva sample or agreed to linkage of their records compared to those who did not. We observed that older, male participants with higher education were more likely to donate a saliva sample. Self-reported bipolar disorder, ADHD, panic disorder, PTSD, substance use disorder, and social anxiety disorder were associated with lower odds of donating a saliva sample, whereas anorexia was associated with higher odds of donation. Male and younger participants showed higher odds of agreeing to record linkage. Participants with higher neuroticism scores and those with a history of bipolar disorder were also more likely to agree to record linkage whereas participants with a diagnosis of anorexia were less likely to agree. Increased likelihood of consent was associated with increased genetic susceptibility to anorexia and reduced genetic risk for depression, and schizophrenia. Overall, our results show moderate differences among these subsamples. Most current epidemiological studies do not search for attrition biases at the genetic level. The possibility to do so is a strength of samples such as the AGDS. Our results suggest that analyses can be made more robust by identifying attrition biases both on the phenotypic and genetic level, and either contextualising them as a potential limitation or performing sensitivity analyses adjusting for them.
Asunto(s)
Anorexia , Depresión , Humanos , Masculino , Depresión/epidemiología , Depresión/genética , Australia , Consentimiento Informado , ADNRESUMEN
Genome-Wide Association Studies (GWAS) in large human cohorts have identified thousands of loci associated with complex traits and diseases. For identifying the genes and gene-associated variants that underlie complex traits in livestock, especially where sample sizes are limiting, it may help to integrate the results of GWAS for equivalent traits in humans as prior information. In this study, we sought to investigate the usefulness of results from a GWAS on human height as prior information for identifying the genes and gene-associated variants that affect stature in cattle, using GWAS summary data on samples sizes of 700,000 and 58,265 for humans and cattle, respectively. Using Fisher's exact test, we observed a significant proportion of cattle stature-associated genes (30/77) that are also associated with human height (odds ratio = 5.1, p = 3.1e-10). Result of randomized sampling tests showed that cattle orthologs of human height-associated genes, hereafter referred to as candidate genes (C-genes), were more enriched for cattle stature GWAS signals than random samples of genes in the cattle genome (p = 0.01). Randomly sampled SNPs within the C-genes also tend to explain more genetic variance for cattle stature (up to 13.2%) than randomly sampled SNPs within random cattle genes (p = 0.09). The most significant SNPs from a cattle GWAS for stature within the C-genes did not explain more genetic variance for cattle stature than the most significant SNPs within random cattle genes (p = 0.87). Altogether, our findings support previous studies that suggest a similarity in the genetic regulation of height across mammalian species. However, with the availability of a powerful GWAS for stature that combined data from 8 cattle breeds, prior information from human-height GWAS does not seem to provide any additional benefit with respect to the identification of genes and gene-associated variants that affect stature in cattle.
Asunto(s)
Estatura/genética , Bovinos/genética , Estudio de Asociación del Genoma Completo/métodos , Animales , Cruzamiento/métodos , Bases de Datos Genéticas , Variación Genética/genética , Humanos , Ganado/genética , Herencia Multifactorial/genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genéticaRESUMEN
The genetic background of childhood body mass index (BMI), and the extent to which the well-known associations of childhood BMI with adult diseases are explained by shared genetic factors, are largely unknown. We performed a genome-wide association study meta-analysis of BMI in 61,111 children aged between 2 and 10 years. Twenty-five independent loci reached genome-wide significance in the combined discovery and replication analyses. Two of these, located near NEDD4L and SLC45A3, have not previously been reported in relation to either childhood or adult BMI. Positive genetic correlations of childhood BMI with birth weight and adult BMI, waist-to-hip ratio, diastolic blood pressure and type 2 diabetes were detected (Rg ranging from 0.11 to 0.76, P-values <0.002). A negative genetic correlation of childhood BMI with age at menarche was observed. Our results suggest that the biological processes underlying childhood BMI largely, but not completely, overlap with those underlying adult BMI. The well-known observational associations of BMI in childhood with cardio-metabolic diseases in adulthood may reflect partial genetic overlap, but in light of previous evidence, it is also likely that they are explained through phenotypic continuity of BMI from childhood into adulthood.
Asunto(s)
Enfermedades Cardiovasculares/genética , Diabetes Mellitus Tipo 2/genética , Predisposición Genética a la Enfermedad , Proteínas de Transporte de Monosacáridos/genética , Ubiquitina-Proteína Ligasas Nedd4/genética , Adolescente , Adulto , Presión Sanguínea , Índice de Masa Corporal , Factores de Riesgo Cardiometabólico , Enfermedades Cardiovasculares/patología , Niño , Preescolar , Diabetes Mellitus Tipo 2/patología , Femenino , Estudio de Asociación del Genoma Completo/métodos , Humanos , Masculino , Menarquia/genética , Análisis de la Aleatorización Mendeliana , Relación Cintura-CaderaRESUMEN
We report genome-wide ancient DNA from 44 ancient Near Easterners ranging in time between ~12,000 and 1,400 bc, from Natufian hunter-gatherers to Bronze Age farmers. We show that the earliest populations of the Near East derived around half their ancestry from a 'Basal Eurasian' lineage that had little if any Neanderthal admixture and that separated from other non-African lineages before their separation from each other. The first farmers of the southern Levant (Israel and Jordan) and Zagros Mountains (Iran) were strongly genetically differentiated, and each descended from local hunter-gatherers. By the time of the Bronze Age, these two populations and Anatolian-related farmers had mixed with each other and with the hunter-gatherers of Europe to greatly reduce genetic differentiation. The impact of the Near Eastern farmers extended beyond the Near East: farmers related to those of Anatolia spread westward into Europe; farmers related to those of the Levant spread southward into East Africa; farmers related to those of Iran spread northward into the Eurasian steppe; and people related to both the early farmers of Iran and to the pastoralists of the Eurasian steppe spread eastward into South Asia.
Asunto(s)
Agricultura/historia , Genómica , Migración Humana/historia , Filogenia , Grupos Raciales/genética , África Oriental , Animales , Armenia , Asia , ADN/análisis , Europa (Continente) , Historia Antigua , Humanos , Hibridación Genética/genética , Irán , Israel , Jordania , Hombre de Neandertal/genética , Filogeografía , TurquíaRESUMEN
More than one in three adults worldwide is either overweight or obese. Epidemiological studies indicate that the location and distribution of excess fat, rather than general adiposity, are more informative for predicting risk of obesity sequelae, including cardiometabolic disease and cancer. We performed a genome-wide association study meta-analysis of body fat distribution, measured by waist-to-hip ratio (WHR) adjusted for body mass index (WHRadjBMI), and identified 463 signals in 346 loci. Heritability and variant effects were generally stronger in women than men, and we found approximately one-third of all signals to be sexually dimorphic. The 5% of individuals carrying the most WHRadjBMI-increasing alleles were 1.62 times more likely than the bottom 5% to have a WHR above the thresholds used for metabolic syndrome. These data, made publicly available, will inform the biology of body fat distribution and its relationship with disease.
Asunto(s)
Adiposidad/genética , Distribución de la Grasa Corporal/métodos , Obesidad/genética , Tejido Adiposo/fisiología , Adulto , Alelos , Índice de Masa Corporal , Femenino , Frecuencia de los Genes/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética , Relación Cintura-Cadera , Población Blanca/genéticaRESUMEN
This study investigates if genetic factors could contribute to the high rate of mood disorders reported in a U.S. community known to have a restricted early founder population (confirmed here through runs of homozygosity analysis). Polygenic scores (PGSs) for eight common diseases, disorders, or traits, including psychiatric disorders, were calculated in 274 participants (125 mood disorder cases) who each reported three or four grandparents born in the community. Ancestry-matched controls were selected from the UK Biobank (UKB; three sets of N = 1,822 each). The mean PGSs were significantly higher in the community for major depression PRS (p = 2.1 × 10-19 , 0.56 SD units), bipolar disorder (p = 2.5 × 10-15 , 0.56 SD units), and schizophrenia (p = 3.8 × 10-21 , 0.64 SD units). The PGSs were not significantly different between the community participants and UKB controls for the traits of body mass index, Type 2 diabetes, coronary artery disease, and chronotype. The mean PGSs for height were significantly lower in the community sample compared to controls (-0.21 SD units, p = 1.2 × 10-5 ). The results are consistent with enrichment of polygenic risk factors for psychiatric disorders in this community.
Asunto(s)
Trastorno Bipolar , Trastorno Depresivo Mayor , Diabetes Mellitus Tipo 2 , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Herencia Multifactorial/genéticaRESUMEN
Recent genome-wide association studies (GWAS) of height and body mass index (BMI) in â¼250000 European participants have led to the discovery of â¼700 and â¼100 nearly independent single nucleotide polymorphisms (SNPs) associated with these traits, respectively. Here we combine summary statistics from those two studies with GWAS of height and BMI performed in â¼450000 UK Biobank participants of European ancestry. Overall, our combined GWAS meta-analysis reaches N â¼700000 individuals and substantially increases the number of GWAS signals associated with these traits. We identified 3290 and 941 near-independent SNPs associated with height and BMI, respectively (at a revised genome-wide significance threshold of P < 1 × 10-8), including 1185 height-associated SNPs and 751 BMI-associated SNPs located within loci not previously identified by these two GWAS. The near-independent genome-wide significant SNPs explain â¼24.6% of the variance of height and â¼6.0% of the variance of BMI in an independent sample from the Health and Retirement Study (HRS). Correlations between polygenic scores based upon these SNPs with actual height and BMI in HRS participants were â¼0.44 and â¼0.22, respectively. From analyses of integrating GWAS and expression quantitative trait loci (eQTL) data by summary-data-based Mendelian randomization, we identified an enrichment of eQTLs among lead height and BMI signals, prioritizing 610 and 138 genes, respectively. Our study demonstrates that, as previously predicted, increasing GWAS sample sizes continues to deliver, by the discovery of new loci, increasing prediction accuracy and providing additional data to achieve deeper insight into complex trait biology. All summary statistics are made available for follow-up studies.
Asunto(s)
Estatura/genética , Índice de Masa Corporal , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Peso Corporal/genética , Femenino , Genoma Humano , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Población Blanca/genéticaRESUMEN
The recent availability of large-scale neuroimaging cohorts facilitates deeper characterisation of the relationship between phenotypic and brain architecture variation in humans. Here, we investigate the association (previously coined morphometricity) of a phenotype with all 652,283 vertex-wise measures of cortical and subcortical morphology in a large data set from the UK Biobank (UKB; N = 9,497 for discovery, N = 4,323 for replication) and the Human Connectome Project (N = 1,110). We used a linear mixed model with the brain measures of individuals fitted as random effects with covariance relationships estimated from the imaging data. We tested 167 behavioural, cognitive, psychiatric or lifestyle phenotypes and found significant morphometricity for 58 phenotypes (spanning substance use, blood assay results, education or income level, diet, depression, and cognition domains), 23 of which replicated in the UKB replication set or the HCP. We then extended the model for a bivariate analysis to estimate grey-matter correlation between phenotypes, which revealed that body size (i.e., height, weight, BMI, waist and hip circumference, body fat percentage) could account for a substantial proportion of the morphometricity (confirmed using a conditional analysis), providing possible insight into previous MRI case-control results for psychiatric disorders where case status is associated with body mass index. Our LMM framework also allowed to predict some of the associated phenotypes from the vertex-wise measures, in two independent samples. Finally, we demonstrated additional new applications of our approach (a) region of interest (ROI) analysis that retain the vertex-wise complexity; (b) comparison of the information retained by different MRI processings.
Asunto(s)
Tamaño Corporal/fisiología , Sustancia Gris/anatomía & histología , Sustancia Gris/diagnóstico por imagen , Neuroimagen/métodos , Fenotipo , Factores de Edad , Anciano , Anciano de 80 o más Años , Conectoma , Bases de Datos Factuales , Femenino , Humanos , Imagen por Resonancia Magnética , Masculino , Persona de Mediana Edad , Factores SexualesRESUMEN
Using data from 5500 adolescents from the National Longitudinal Study of Adolescent to Adult Health, Domingue et al. (Proc Natl Acad Sci 25:256., 2018) claimed to show that friends are genetically more similar to one another than randomly selected peers, beyond the confounding effects of population stratification by ancestry. The authors also claimed to show 'social-genetic' effects, whereby individuals' educational attainment (EA) is influenced by their friends' genes. We argue that neither claim is justified by the data. Mathematically we show that (1) the genetic similarity reported between friends is far larger than theoretically possible if it was caused by phenotypic assortment as the authors claim; uncontrolled population stratification is a likely reason for the genetic similarity they observed, and (2) significant association between individuals' EA and their friends' polygenic scores for EA is a necessary consequence of EA similarity among friends, and does not provide evidence for social-genetic effects. Going forward, we urge caution in the analysis and interpretation of data at the intersection of human genetics and the social sciences.
Asunto(s)
Conducta del Adolescente/psicología , Amigos/psicología , Genotipo , Adolescente , Escolaridad , Femenino , Humanos , Masculino , Modelos Teóricos , Grupo Paritario , Polimorfismo de Nucleótido Simple/genética , Conducta Social , Medio Social , Integración Social , Ciencias Sociales/métodos , Ciencias Sociales/tendenciasRESUMEN
Quantifying the effects of inbreeding is critical to characterizing the genetic architecture of complex traits. This study highlights through theory and simulations the strengths and shortcomings of three SNP-based inbreeding measures commonly used to estimate inbreeding depression (ID). We demonstrate that heterogeneity in linkage disequilibrium (LD) between causal variants and SNPs biases ID estimates, and we develop an approach to correct this bias using LD and minor allele frequency stratified inference (LDMS). We quantified ID in 25 traits measured in [Formula: see text] participants of the UK Biobank, using LDMS, and confirmed previously published ID for 4 traits. We find unique evidence of ID for handgrip strength, waist/hip ratio, and visual and auditory acuity (ID between -2.3 and -5.2 phenotypic SDs for complete inbreeding; [Formula: see text]). Our results illustrate that a careful choice of the measure of inbreeding combined with LDMS stratification improves both detection and quantification of ID using SNP data.