Your browser doesn't support javascript.
loading
Strategies for phasing and imputation in a population isolate.
Herzig, Anthony Francis; Nutile, Teresa; Babron, Marie-Claude; Ciullo, Marina; Bellenguez, Céline; Leutenegger, Anne-Louise.
Afiliação
  • Herzig AF; Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.
  • Nutile T; Inserm, U946, Genetic Variation and Human Diseases, Paris, France.
  • Babron MC; Institute of Genetics and Biophysics A. Buzzati-Traverso-CNR, Naples, Italy.
  • Ciullo M; Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.
  • Bellenguez C; Inserm, U946, Genetic Variation and Human Diseases, Paris, France.
  • Leutenegger AL; Institute of Genetics and Biophysics A. Buzzati-Traverso-CNR, Naples, Italy.
Genet Epidemiol ; 42(2): 201-213, 2018 03.
Article em En | MEDLINE | ID: mdl-29319195
ABSTRACT
In the search for genetic associations with complex traits, population isolates offer the advantage of reduced genetic and environmental heterogeneity. In addition, cost-efficient next-generation association approaches have been proposed in these populations where only a subsample of representative individuals is sequenced and then genotypes are imputed into the rest of the population. Gene mapping in such populations thus requires high-quality genetic imputation and preliminary phasing. To identify an effective study design, we compare by simulation a range of phasing and imputation software and strategies. We simulated 1,115,604 variants on chromosome 10 for 477 members of the large complex pedigree of Campora, a village within the established isolate of Cilento in southern Italy. We assessed the phasing performance of identical by descent based software ALPHAPHASE and SLRP, LD-based software SHAPEIT2, SHAPEIT3, and BEAGLE, and new software EAGLE that combines both methodologies. For imputation we compared IMPUTE2, IMPUTE4, MINIMAC3, BEAGLE, and new software PBWT. Genotyping errors and missing genotypes were simulated to observe their effects on the performance of each software. Highly accurate phased data were achieved by all software with SHAPEIT2, SHAPEIT3, and EAGLE2 providing the most accurate results. MINIMAC3, IMPUTE4, and IMPUTE2 all performed strongly as imputation software and our study highlights the considerable gain in imputation accuracy provided by a genome sequenced reference panel specific to the population isolate.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Projetos de Pesquisa / Haplótipos / Software / Efeito Fundador / Genética Populacional Tipo de estudo: Prognostic_studies Limite: Female / Humans / Male País/Região como assunto: Europa Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Projetos de Pesquisa / Haplótipos / Software / Efeito Fundador / Genética Populacional Tipo de estudo: Prognostic_studies Limite: Female / Humans / Male País/Região como assunto: Europa Idioma: En Ano de publicação: 2018 Tipo de documento: Article