RESUMO
The impact of rare recurrent copy number variants (rCNVs) and polygenic background attributed to common variants, on the risk of psychiatric disorders is well-established in separate studies. However, it remains unclear how polygenic background modulates the effect of rCNVs. Using the population-representative iPSYCH2015 case-cohort sample (N=96,599), we investigated the association between absolute risk of psychiatric disorders and carriage of rCNVs and polygenic scores (PGS), as well as the interaction effect between the two on disease risk. Carriers of rCNVs with higher gene constraint scores had an increased absolute risk for autism, ADHD, and schizophrenia, but not depression, whereas an increase in PGS for each respective disorder was associated with higher absolute risk across all four disorders. Similarly, elevated absolute risks were observed with the increase of both PGS and gene constraints of rCNVs except in the case of depression. In contrast to some previous case-control studies, our joint analysis of rCNV groups and PGS revealed no indication of significant interactive effect between these two factors on disease risk. Also, we found no significant interactions of PGS with any of the most common individual rCNVs, except in the case of 16p13.11 duplication, which was found to attenuate the effect of ADHD-PGS on the absolute risk of ADHD. This study advances our understanding of the interplay between rare and common important genetic risk factors for major psychiatric disorders and sheds light on the importance of population-based samples in implementing precision medicine.
RESUMO
Emerging evidence has shown that assortative mating (AM) is a key factor that shapes the landscape of complex human traits. It can increase the overall prevalence of disorders, influence occurrences of comorbidities, and bias estimation of genetic architectures. However, there is lack of large-scale studies to examine the cultural differences and the generational trends of AM for psychiatric disorders. Here, using national registry datasets, we conduct the largest scale of AM analyses on nine psychiatric disorders, with up to 1.4 million mated cases and 6 million matched controls. We performed meta-analyses on AM estimates from Taiwan, Denmark, and Sweden, to examine the potential impact of cultural differences. Generational changes for people born after 1930s were investigated as well. We found that AM of psychiatric disorders are consistent across nations and persistent over generations, with a small proportion of disorders showing generational changes of AM. Our results provide additional insight into the mechanisms of AM across psychiatric disorders and have evident implications on the estimation of the genetic architectures of psychiatric disorders.
RESUMO
Importance: Recurrent copy number variants (rCNVs) have been associated with increased risk of psychiatric disorders in case-control studies, but their population-level impact is unknown. Objective: To provide unbiased population-based estimates of prevalence and risk associated with psychiatric disorders for rCNVs and to compare risks across outcomes, rCNV dosage type (deletions or duplications), and locus features. Design, Setting, and Participants: This genetic association study is an analysis of data from the Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH) case-cohort sample of individuals born in Denmark in 1981-2008 and followed up until 2015, including (1) all individuals (n = 92â¯531) with a hospital discharge diagnosis of attention-deficit/hyperactivity disorder (ADHD), autism spectrum disorder (ASD), bipolar disorder, major depressive disorder (MDD), or schizophrenia spectrum disorder (SSD) and (2) a subcohort (n = 50â¯625) randomly drawn from the source population. Data were analyzed from January 2021 to August 2023. Exposures: Carrier status of deletions and duplications at 27 autosomal rCNV loci was determined from neonatal blood samples genotyped on single-nucleotide variant microarrays. Main Outcomes and Measures: Population-based rCNV prevalence was estimated with a survey model using finite population correction to account for oversampling of cases. Hazard ratio (HR) estimates and 95% CIs for psychiatric disorders were derived using weighted Cox proportional hazard models. Risks were compared across outcomes, dosage type, and locus features using generalized estimating equation models. Results: A total of 3547 rCNVs were identified in 64â¯735 individuals assigned male at birth (53.8%) and 55â¯512 individuals assigned female at birth (46.2%) whose age at the end of follow-up ranged from 7.0 to 34.7 years (mean, 21.8 years). Most observed increases in rCNV-associated risk for ADHD, ASD, or SSD were moderate, and risk estimates were highly correlated across these disorders. Notable exceptions included high ASD-associated risk observed for Prader-Willi/Angelman syndrome duplications (HR, 20.8; 95% CI, 7.9-55). No rCNV was associated with increased MDD risk. Also, rCNV-associated risk was positively correlated with locus size and gene constraint but not with dosage type. Comparison with published case-control and community-based studies revealed a higher prevalence of deletions and lower associated increase in risk for several rCNVs in iPSYCH2015. Conclusions and Relevance: This study found that several rCNVs were more prevalent and conferred less risk of psychiatric disorders than estimated previously. Most case-control studies overestimate rCNV-associated risk of psychiatric disorders, likely because of selection bias. In an era where genetics is increasingly being clinically applied, these results highlight the importance of population-based risk estimates for genetics-based predictions.
Assuntos
Transtorno do Deficit de Atenção com Hiperatividade , Transtorno do Espectro Autista , Variações do Número de Cópias de DNA , Transtorno Depressivo Maior , Humanos , Variações do Número de Cópias de DNA/genética , Masculino , Feminino , Transtorno do Espectro Autista/genética , Transtorno do Espectro Autista/epidemiologia , Transtorno do Deficit de Atenção com Hiperatividade/genética , Transtorno do Deficit de Atenção com Hiperatividade/epidemiologia , Dinamarca/epidemiologia , Transtorno Depressivo Maior/genética , Transtorno Depressivo Maior/epidemiologia , Transtornos Mentais/genética , Transtornos Mentais/epidemiologia , Esquizofrenia/genética , Esquizofrenia/epidemiologia , Transtorno Bipolar/genética , Transtorno Bipolar/epidemiologia , Adulto , Predisposição Genética para Doença/genética , Prevalência , Criança , Estudos de Casos e Controles , Adolescente , Estudos de Associação Genética , Estudos de Coortes , Polimorfismo de Nucleotídeo Único , Adulto JovemRESUMO
Large-scale genome-wide association studies (GWAS) strongly suggest that most traits and diseases have a polygenic component. This observation has motivated the development of disease-specific "polygenic scores (PGS)" that are weighted sums of the effects of disease-associated variants identified from GWAS that correlate with an individual's likelihood of expressing a specific phenotype. Although most GWAS have been pursued on disease traits, leading to the creation of refined "Polygenic Risk Scores" (PRS) that quantify risk to diseases, many GWAS have also been pursued on extreme human longevity, general fitness, health span, and other health-positive traits. These GWAS have discovered many genetic variants seemingly protective from disease and are often different from disease-associated variants (i.e., they are not just alternative alleles at disease-associated loci) and suggest that many health-positive traits also have a polygenic basis. This observation has led to an interest in "polygenic longevity scores (PLS)" that quantify the "risk" or genetic predisposition of an individual towards health. We derived 11 different PLS from 4 different available GWAS on lifespan and then investigated the properties of these PLS using data from the UK Biobank (UKB). Tests of association between the PLS and population structure, parental lifespan, and several cancerous and non-cancerous diseases, including death from COVID-19, were performed. Based on the results of our analyses, we argue that PLS are made up of variants not only robustly associated with parental lifespan, but that also contribute to the genetic architecture of disease susceptibility, morbidity, and mortality.
Assuntos
Bancos de Espécimes Biológicos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Longevidade , Herança Multifatorial , Humanos , Longevidade/genética , Herança Multifatorial/genética , Reino Unido/epidemiologia , Feminino , Masculino , COVID-19/genética , COVID-19/epidemiologia , Pais , Idoso , Biobanco do Reino UnidoRESUMO
Major migration events in Holocene Eurasia have been characterized genetically at broad regional scales1-4. However, insights into the population dynamics in the contact zones are hampered by a lack of ancient genomic data sampled at high spatiotemporal resolution5-7. Here, to address this, we analysed shotgun-sequenced genomes from 100 skeletons spanning 7,300 years of the Mesolithic period, Neolithic period and Early Bronze Age in Denmark and integrated these with proxies for diet (13C and 15N content), mobility (87Sr/86Sr ratio) and vegetation cover (pollen). We observe that Danish Mesolithic individuals of the Maglemose, Kongemose and Ertebølle cultures form a distinct genetic cluster related to other Western European hunter-gatherers. Despite shifts in material culture they displayed genetic homogeneity from around 10,500 to 5,900 calibrated years before present, when Neolithic farmers with Anatolian-derived ancestry arrived. Although the Neolithic transition was delayed by more than a millennium relative to Central Europe, it was very abrupt and resulted in a population turnover with limited genetic contribution from local hunter-gatherers. The succeeding Neolithic population, associated with the Funnel Beaker culture, persisted for only about 1,000 years before immigrants with eastern Steppe-derived ancestry arrived. This second and equally rapid population replacement gave rise to the Single Grave culture with an ancestry profile more similar to present-day Danes. In our multiproxy dataset, these major demographic events are manifested as parallel shifts in genotype, phenotype, diet and land use.
Assuntos
Genoma Humano , Genômica , Migração Humana , Populações Escandinavas e Nórdicas , Humanos , Dinamarca/etnologia , Emigrantes e Imigrantes/história , Genótipo , Populações Escandinavas e Nórdicas/genética , Populações Escandinavas e Nórdicas/história , Migração Humana/história , Genoma Humano/genética , História Antiga , Pólen , Dieta/história , Caça/história , Fazendeiros/história , Cultura , Fenótipo , Conjuntos de Dados como AssuntoRESUMO
Western Eurasia witnessed several large-scale human migrations during the Holocene1-5. Here, to investigate the cross-continental effects of these migrations, we shotgun-sequenced 317 genomes-mainly from the Mesolithic and Neolithic periods-from across northern and western Eurasia. These were imputed alongside published data to obtain diploid genotypes from more than 1,600 ancient humans. Our analyses revealed a 'great divide' genomic boundary extending from the Black Sea to the Baltic. Mesolithic hunter-gatherers were highly genetically differentiated east and west of this zone, and the effect of the neolithization was equally disparate. Large-scale ancestry shifts occurred in the west as farming was introduced, including near-total replacement of hunter-gatherers in many areas, whereas no substantial ancestry shifts happened east of the zone during the same period. Similarly, relatedness decreased in the west from the Neolithic transition onwards, whereas, east of the Urals, relatedness remained high until around 4,000 BP, consistent with the persistence of localized groups of hunter-gatherers. The boundary dissolved when Yamnaya-related ancestry spread across western Eurasia around 5,000 BP, resulting in a second major turnover that reached most parts of Europe within a 1,000-year span. The genetic origin and fate of the Yamnaya have remained elusive, but we show that hunter-gatherers from the Middle Don region contributed ancestry to them. Yamnaya groups later admixed with individuals associated with the Globular Amphora culture before expanding into Europe. Similar turnovers occurred in western Siberia, where we report new genomic data from a 'Neolithic steppe' cline spanning the Siberian forest steppe to Lake Baikal. These prehistoric migrations had profound and lasting effects on the genetic diversity of Eurasian populations.
Assuntos
Genética Populacional , Genoma Humano , Migração Humana , Metagenômica , Humanos , Agricultura/história , Ásia Ocidental , Mar Negro , Diploide , Europa (Continente)/etnologia , Genótipo , História Antiga , Migração Humana/história , Caça/história , Camada de GeloRESUMO
The Holocene (beginning around 12,000 years ago) encompassed some of the most significant changes in human evolution, with far-reaching consequences for the dietary, physical and mental health of present-day populations. Using a dataset of more than 1,600 imputed ancient genomes1, we modelled the selection landscape during the transition from hunting and gathering, to farming and pastoralism across West Eurasia. We identify key selection signals related to metabolism, including that selection at the FADS cluster began earlier than previously reported and that selection near the LCT locus predates the emergence of the lactase persistence allele by thousands of years. We also find strong selection in the HLA region, possibly due to increased exposure to pathogens during the Bronze Age. Using ancient individuals to infer local ancestry tracts in over 400,000 samples from the UK Biobank, we identify widespread differences in the distribution of Mesolithic, Neolithic and Bronze Age ancestries across Eurasia. By calculating ancestry-specific polygenic risk scores, we show that height differences between Northern and Southern Europe are associated with differential Steppe ancestry, rather than selection, and that risk alleles for mood-related phenotypes are enriched for Neolithic farmer ancestry, whereas risk alleles for diabetes and Alzheimer's disease are enriched for Western hunter-gatherer ancestry. Our results indicate that ancient selection and migration were large contributors to the distribution of phenotypic diversity in present-day Europeans.
Assuntos
Asiático , População Europeia , Genoma Humano , Seleção Genética , Humanos , Afeto , Agricultura/história , Alelos , Doença de Alzheimer/genética , Ásia/etnologia , Asiático/genética , Diabetes Mellitus/genética , Europa (Continente)/etnologia , População Europeia/genética , Fazendeiros/história , Loci Gênicos/genética , Predisposição Genética para Doença , Genoma Humano/genética , História Antiga , Migração Humana , Caça/história , Família Multigênica/genética , Fenótipo , Biobanco do Reino Unido , Herança Multifatorial/genéticaRESUMO
Attention deficit hyperactivity disorder (ADHD) is a complex disorder that manifests variability in long-term outcomes and clinical presentations. The genetic contributions to such heterogeneity are not well understood. Here we show several genetic links to clinical heterogeneity in ADHD in a case-only study of 14,084 diagnosed individuals. First, we identify one genome-wide significant locus by comparing cases with ADHD and autism spectrum disorder (ASD) to cases with ADHD but not ASD. Second, we show that cases with ASD and ADHD, substance use disorder and ADHD, or first diagnosed with ADHD in adulthood have unique polygenic score (PGS) profiles that distinguish them from complementary case subgroups and controls. Finally, a PGS for an ASD diagnosis in ADHD cases predicted cognitive performance in an independent developmental cohort. Our approach uncovered evidence of genetic heterogeneity in ADHD, helping us to understand its etiology and providing a model for studies of other disorders.
Assuntos
Transtorno do Deficit de Atenção com Hiperatividade , Transtorno do Espectro Autista , Humanos , Transtorno do Espectro Autista/genética , Transtorno do Deficit de Atenção com Hiperatividade/genética , Herança Multifatorial/genéticaRESUMO
BACKGROUND: Symptoms of major depressive disorder (MDD) are commonly assessed using self-rating instruments like the Patient Health Questionnaire-9 (PHQ-9) (current symptoms) and the Composite International Diagnostic Interview Short-Form (CIDI-SF) (worst-episode symptoms). We performed a systematic comparison between them for their genetic architecture and utility in investigating MDD heterogeneity. METHODS: Using data from the UK Biobank (n = 41,948-109,417), we assessed the single nucleotide polymorphism heritability and genetic correlation (rg) of both sets of MDD symptoms. We further compared their rg with non-MDD traits and used Mendelian randomization to assess whether either set of symptoms has more genetic sharing with non-MDD traits. We also assessed how specific each set of symptoms is to MDD using the metric polygenic risk score pleiotropy. Finally, we used genomic structural equation modeling to identify factors that explain the genetic covariance between each set of symptoms. RESULTS: Corresponding symptoms reported through the PHQ-9 and CIDI-SF have low to moderate genetic correlations (rg = 0.43-0.87), and this cannot be fully attributed to different severity thresholds or the use of a skip structure in the CIDI-SF. Both Mendelian randomization and polygenic risk score pleiotropy analyses showed that PHQ-9 symptoms are more associated with traits that reflect general dysphoria, whereas the skip structure in the CIDI-SF allows for the identification of heterogeneity among likely MDD cases. Finally, the 2 sets of symptoms showed different factor structures in genomic structural equation modeling, reflective of their genetic differences. CONCLUSIONS: MDD symptoms assessed using the PHQ-9 and CIDI-SF are not interchangeable; the former better indexes general dysphoria, while the latter is more informative about within-MDD heterogeneity.
RESUMO
Biobanks that collect deep phenotypic and genomic data across many individuals have emerged as a key resource in human genetics. However, phenotypes in biobanks are often missing across many individuals, limiting their utility. We propose AutoComplete, a deep learning-based imputation method to impute or 'fill-in' missing phenotypes in population-scale biobank datasets. When applied to collections of phenotypes measured across ~300,000 individuals from the UK Biobank, AutoComplete substantially improved imputation accuracy over existing methods. On three traits with notable amounts of missingness, we show that AutoComplete yields imputed phenotypes that are genetically similar to the originally observed phenotypes while increasing the effective sample size by about twofold on average. Further, genome-wide association analyses on the resulting imputed phenotypes led to a substantial increase in the number of associated loci. Our results demonstrate the utility of deep learning-based phenotype imputation to increase power for genetic discoveries in existing biobank datasets.
Assuntos
Aprendizado Profundo , Estudo de Associação Genômica Ampla , Humanos , Estudo de Associação Genômica Ampla/métodos , Genótipo , Bancos de Espécimes Biológicos , Polimorfismo de Nucleotídeo Único , FenótipoRESUMO
Biobanks often contain several phenotypes relevant to diseases such as major depressive disorder (MDD), with partly distinct genetic architectures. Researchers face complex tradeoffs between shallow (large sample size, low specificity/sensitivity) and deep (small sample size, high specificity/sensitivity) phenotypes, and the optimal choices are often unclear. Here we propose to integrate these phenotypes to combine the benefits of each. We use phenotype imputation to integrate information across hundreds of MDD-relevant phenotypes, which significantly increases genome-wide association study (GWAS) power and polygenic risk score (PRS) prediction accuracy of the deepest available MDD phenotype in UK Biobank, LifetimeMDD. We demonstrate that imputation preserves specificity in its genetic architecture using a novel PRS-based pleiotropy metric. We further find that integration via summary statistics also enhances GWAS power and PRS predictions, but can introduce nonspecific genetic effects depending on input. Our work provides a simple and scalable approach to improve genetic studies in large biobanks by integrating shallow and deep phenotypes.
Assuntos
Transtorno Depressivo Maior , Humanos , Transtorno Depressivo Maior/genética , Predisposição Genética para Doença , Bancos de Espécimes Biológicos , Estudo de Associação Genômica Ampla , Herança Multifatorial/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Recurrent copy number variants (rCNVs) are associated with increased risk of neuropsychiatric disorders but their pathogenic population-level impact is unknown. We provide population-based estimates of rCNV-associated risk of neuropsychiatric disorders for 34 rCNVs in the iPSYCH2015 case-cohort sample (n=120,247). Most observed significant increases in rCNV-associated risk for ADHD, autism or schizophrenia were moderate (HR:1.42-5.00), and risk estimates were highly correlated across these disorders, the most notable exception being high autism-associated risk with Prader-Willi/Angelman Syndrome duplications (HR=20.8). No rCNV was associated with significant increase in depression risk. Also, rCNV-associated risk was positively correlated with locus size and gene constraint. Comparison with published rCNV studies suggests that prevalence of some rCNVs is higher, and risk of psychiatric disorders lower, than previously estimated. In an era where genetics is increasingly being clinically applied, our results highlight the importance of population-based risk estimates for genetics-based predictions.
RESUMO
Proportional hazards models have been proposed to analyse time-to-event phenotypes in genome-wide association studies (GWAS). However, little is known about the ability of proportional hazards models to identify genetic associations under different generative models and when ascertainment is present. Here we propose the age-dependent liability threshold (ADuLT) model as an alternative to a Cox regression based GWAS, here represented by SPACox. We compare ADuLT, SPACox, and standard case-control GWAS in simulations under two generative models and with varying degrees of ascertainment as well as in the iPSYCH cohort. We find Cox regression GWAS to be underpowered when cases are strongly ascertained (cases are oversampled by a factor 5), regardless of the generative model used. ADuLT is robust to ascertainment in all simulated scenarios. Then, we analyse four psychiatric disorders in iPSYCH, ADHD, Autism, Depression, and Schizophrenia, with a strong case-ascertainment. Across these psychiatric disorders, ADuLT identifies 20 independent genome-wide significant associations, case-control GWAS finds 17, and SPACox finds 8, which is consistent with simulation results. As more genetic data are being linked to electronic health records, robust GWAS methods that can make use of age-of-onset information will help increase power in analyses for common health outcomes.
Assuntos
Transtorno Autístico , Estudo de Associação Genômica Ampla , Humanos , Simulação por Computador , Registros Eletrônicos de Saúde , Fator VRESUMO
The predictive performance of polygenic scores (PGS) is largely dependent on the number of samples available to train the PGS. Increasing the sample size for a specific phenotype is expensive and takes time, but this sample size can be effectively increased by using genetically correlated phenotypes. We propose a framework to generate multi-PGS from thousands of publicly available genome-wide association studies (GWAS) with no need to individually select the most relevant ones. In this study, the multi-PGS framework increases prediction accuracy over single PGS for all included psychiatric disorders and other available outcomes, with prediction R2 increases of up to 9-fold for attention-deficit/hyperactivity disorder compared to a single PGS. We also generate multi-PGS for phenotypes without an existing GWAS and for case-case predictions. We benchmark the multi-PGS framework against other methods and highlight its potential application to new emerging biobanks.
Assuntos
Transtorno do Deficit de Atenção com Hiperatividade , Estudo de Associação Genômica Ampla , Humanos , Transtorno do Deficit de Atenção com Hiperatividade/genética , Fenótipo , Herança Multifatorial/genéticaRESUMO
Circulating levels of the astrocytic marker S100B have been associated with risk of neuropsychiatric or neurological disorders. However, reported effects have been inconsistent, and no causal relations have yet been established. We applied two-sample Mendelian Randomization (MR) on the association statistics from genome-wide association studies (GWAS) for circulating S100B levels measured 5-7 days after birth (the iPSYCH sample) and in an older adult sample (mean age, 72.5 years; the Lothian sample), upon those derived from major depression disorder (MDD), schizophrenia (SCZ), bipolar disorder (BIP), autism spectral disorder (ASD), Alzheimer's disease (AD), and Parkinson's disease (PD). We studied the causal relations in the two S100B datasets for risk of these six neuropsychiatric disorders. MR suggested increased S100B levels 5-7 days after birth to causally increase the risk of MDD (OR = 1.014; 95%CI = 1.007-1.022; FDR-corrected p = 6.43×10-4). In older adults, MR suggested increased S100B levels to have a causal relation to the risk of BIP (OR = 1.075; 95%CI = 1.026-1.127; FDR-corrected p = 1.35×10-2). No significant causal relations were found for the other five disorders. We did not observe any evidence for reverse causality of these neuropsychiatric or neurological disorders on altered S100B levels. Sensitivity analyses using more stringent SNP-selection criteria and three alternative MR models suggested the results are robust. Altogether, our findings imply a small cause-effect relation for the previously reported associations of S100B and mood disorders. Such findings may provide a novel avenue for the diagnosis and management of disorders.
Assuntos
Transtorno Depressivo Maior , Doenças do Sistema Nervoso , Doença de Parkinson , Recém-Nascido , Humanos , Idoso , Estudo de Associação Genômica Ampla , Análise da Randomização Mendeliana , Doenças do Sistema Nervoso/genética , Transtorno Depressivo Maior/genética , Subunidade beta da Proteína Ligante de Cálcio S100/genéticaRESUMO
Symptoms of Major Depressive Disorder (MDD) are commonly assessed using self-rating instruments like the Patient Health Questionnaire 9 (PHQ9, for current symptoms), and the Composite International Diagnostic Interview Short-Form (CIDI-SF, for lifetime worst-episode symptoms). Using data from the UKBiobank, we show that corresponding symptoms endorsed through PHQ9 and CIDI-SF have low to moderate genetic correlations (rG=0.43-0.87), and this cannot be fully attributed to different severity thresholds or the use of a skip-structure in CIDI-SF. Through a combination of Mendelian Randomization (MR) and polygenic prediction analyses, we find that PHQ9 symptoms are more associated with traits which reflect general dysphoria, while the skip-structure in CIDI-SF allows for the identification of heterogeneity among likely MDD cases. This has important implications on factor analyses performed on their respective genetic covariance matrices for the purpose of identification of genetic factors behind MDD symptom dimensions and heterogeneity.
RESUMO
BACKGROUND: Increased prevalence of mental illness has been reported in clinical studies of sex chromosome aneuploidies, but accurate population-based estimates of the prevalence and clinical detection rate of sex chromosome aneuploidies and the associated risks of psychiatric disorders are needed. In this study, we provide such estimates, valid for children and young adults of the contemporary Danish population. METHODS: We used the iPSYCH2015 case-cohort dataset, which is based on a source population of single-born individuals born in Denmark between May 1, 1981, and Dec 31, 2008. The case sample comprises all individuals from the source population with a diagnosis of any index psychiatric disorder (schizophrenia spectrum disorder, bipolar disorder, major depressive disorder, autism spectrum disorder, or ADHD) by the end of follow-up (Dec 31, 2015), registered in the hospital-based Danish Psychiatric Central Research Register. The cohort consists of individuals randomly selected from the source population, and overlaps with the case sample. Biobanked blood samples for individuals in the case and cohort samples underwent genotyping and quality-control filtering, after which we analysed microarray data to detect sex chromosome aneuploidy karyotypes (45,X, 47,XXX, 47,XXY, and 47,XYY). We estimated the population-valid prevalence of these karyotypes from the cohort sample. Weighted Cox proportional hazards models were used to estimate the risks of each index psychiatric disorder associated with each sex chromosome aneuploidy karyotype, by use of date of first hospitalisation with the index disorder in the respective case group and the cohort as outcome. The clinical detection rate was determined by comparing records of clinical diagnoses of genetic conditions from the Danish National Patient Register with sex chromosome aneuploidy karyotype determined by our study. FINDINGS: The assessed sample comprised 119â481 individuals (78â726 in the case sample and 43â326 in the cohort) who had genotyped and quality-control-filtered blood samples, including 64â533 (54%) people of gonadal male sex and 54â948 (46%) of gonadal female sex. Age during follow-up ranged from 0 to 34·7 years (mean 10·9 years [SD 3·5 years]). Information on ethnicity was not available. We identified 387 (0·3%) individuals as carriers of sex chromosome aneuploidies. The overall prevalence of sex chromosome aneuploidies was 1·5 per 1000 individuals. Each sex chromosome aneuploidy karyotype was associated with an increased risk of at least one index psychiatric disorder, with hazard ratios (HRs) of 2·20 (95% CI 1·42-3·39) for 47,XXY; 2·73 (1·25-6·00) for 47,XXX; 3·56 (1·01-12·53) for 45,X; and 4·30 (2·48-7·55) for 47,XYY. All karyotypes were associated with an increased risk of ADHD (HRs ranging from 1·99 [1·24-3·19] to 6·15 [1·63-23·19]), autism spectrum disorder (2·72 [1·72-4·32] to 8·45 [2·49-28·61]), and schizophrenia spectrum disorder (1·80 [1·15-2·80] to 4·60 [1·57-13·51]). Increased risk of major depressive disorder was found for individuals with 47,XXY (1·88 [1·07-3·33]) and 47,XYY (2·65 [1·12-5·90]), and of bipolar disorder for those with 47,XXX (4·32 [1·12-16·62]). The proportion of sex chromosome aneuploidy carriers who had been clinically diagnosed was 93% for 45,X, but lower for 47,XXY (22%), 47,XXX (15%), and 47,XYY (15%). Among carriers, the risk of diagnosis of at least one index psychiatric disorder did not significantly differ between those who had and had not been clinically diagnosed with sex chromosome aneuploidies (p=0·65). INTERPRETATION: Increased risks of psychiatric disorders associated with sex chromosome aneuploidies, combined with low rates of clinical diagnosis of sex chromosome aneuploidies, compromise the adequate provision of necessary health care and counselling to affected individuals and their families, which might be helped by increased application of genetic testing in clinical settings. FUNDING: Lundbeck Foundation and National Institutes of Health.
Assuntos
Transtorno do Espectro Autista , Transtorno Depressivo Maior , Transtornos Mentais , Criança , Adulto Jovem , Humanos , Masculino , Feminino , Recém-Nascido , Lactente , Pré-Escolar , Adolescente , Adulto , Aneuploidia , Estudos de Coortes , Transtorno Depressivo Maior/epidemiologia , Transtorno Depressivo Maior/genética , Transtornos Mentais/genética , Cromossomos Humanos X , Dinamarca/epidemiologiaRESUMO
Sample recruitment for research consortia, biobanks, and personal genomics companies span years, necessitating genotyping in batches, using different technologies. As marker content on genotyping arrays varies, integrating such datasets is non-trivial and its impact on haplotype estimation (phasing) and whole genome imputation, necessary steps for complex trait analysis, remains under-evaluated. Using the iPSYCH dataset, comprising 130,438 individuals, genotyped in two stages, on different arrays, we evaluated phasing and imputation performance across multiple phasing methods and data integration protocols. While phasing accuracy varied by choice of method and data integration protocol, imputation accuracy varied mostly between data integration protocols. We demonstrate an attenuation in imputation accuracy within samples of non-European origin, highlighting challenges to studying complex traits in diverse populations. Finally, imputation errors can bias association tests, reduce predictive utility of polygenic scores. Carefully optimized data integration strategies enhance accuracy and replicability of complex trait analyses in complex biobanks.
Assuntos
Bancos de Espécimes Biológicos , Herança Multifatorial , Humanos , Haplótipos , Genoma , GenótipoRESUMO
Cognitive functions of individuals with psychiatric disorders differ from that of the general population. Such cognitive differences often manifest early in life as differential school performance and have a strong genetic basis. Here we measured genetic predictors of school performance in 30,982 individuals in English, Danish and mathematics via a genome-wide association study (GWAS) and studied their relationship with risk for six major psychiatric disorders. When decomposing the school performance into math and language-specific performances, we observed phenotypically and genetically a strong negative correlation between math performance and risk for most psychiatric disorders. But language performance correlated positively with risk for certain disorders, especially schizophrenia, which we replicate in an independent sample (n = 4547). We also found that the genetic variants relating to increased risk for schizophrenia and better language performance are overrepresented in individuals involved in creative professions (n = 2953) compared to the general population (n = 164,622). The findings together suggest that language ability, creativity and psychopathology might stem from overlapping genetic roots.