Your browser doesn't support javascript.
loading
Accuracy of haplotype estimation and whole genome imputation affects complex trait analyses in complex biobanks.
Appadurai, Vivek; Bybjerg-Grauholm, Jonas; Krebs, Morten Dybdahl; Rosengren, Anders; Buil, Alfonso; Ingason, Andrés; Mors, Ole; Børglum, Anders D; Hougaard, David M; Nordentoft, Merete; Mortensen, Preben B; Delaneau, Olivier; Werge, Thomas; Schork, Andrew J.
Afiliação
  • Appadurai V; Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark. vivek.appadurai@regionh.dk.
  • Bybjerg-Grauholm J; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark. vivek.appadurai@regionh.dk.
  • Krebs MD; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
  • Rosengren A; Danish Center for Neonatal Screening, Statens Serum Institut, Copenhagen, Denmark.
  • Buil A; Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
  • Ingason A; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
  • Mors O; Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
  • Børglum AD; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
  • Hougaard DM; Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
  • Nordentoft M; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
  • Mortensen PB; Institute of Biological Psychiatry, Mental Health Center Sankt Hans, Roskilde, 4000, Denmark.
  • Delaneau O; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
  • Werge T; The Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Aarhus, Denmark.
  • Schork AJ; Psychosis Research Unit, Aarhus University Hospital - Psychiatry, Aarhus, Denmark.
Commun Biol ; 6(1): 101, 2023 01 26.
Article em En | MEDLINE | ID: mdl-36697501
Sample recruitment for research consortia, biobanks, and personal genomics companies span years, necessitating genotyping in batches, using different technologies. As marker content on genotyping arrays varies, integrating such datasets is non-trivial and its impact on haplotype estimation (phasing) and whole genome imputation, necessary steps for complex trait analysis, remains under-evaluated. Using the iPSYCH dataset, comprising 130,438 individuals, genotyped in two stages, on different arrays, we evaluated phasing and imputation performance across multiple phasing methods and data integration protocols. While phasing accuracy varied by choice of method and data integration protocol, imputation accuracy varied mostly between data integration protocols. We demonstrate an attenuation in imputation accuracy within samples of non-European origin, highlighting challenges to studying complex traits in diverse populations. Finally, imputation errors can bias association tests, reduce predictive utility of polygenic scores. Carefully optimized data integration strategies enhance accuracy and replicability of complex trait analyses in complex biobanks.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Bancos de Espécimes Biológicos / Herança Multifatorial Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Commun Biol Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Dinamarca País de publicação: Reino Unido

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Bancos de Espécimes Biológicos / Herança Multifatorial Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Commun Biol Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Dinamarca País de publicação: Reino Unido