RESUMO
We performed a genome-wide association study of total hip replacements, based on variants identified through whole-genome sequencing, which included 4,657 Icelandic patients and 207,514 population controls. We discovered two rare signals that strongly associate with osteoarthritis total hip replacement: a missense variant, c.1141G>C (p.Asp369His), in the COMP gene (allelic frequency = 0.026%, P = 4.0 × 10-12, odds ratio (OR) = 16.7) and a frameshift mutation, rs532464664 (p.Val330Glyfs*106), in the CHADL gene that associates through a recessive mode of inheritance (homozygote frequency = 0.15%, P = 4.5 × 10-18, OR = 7.71). On average, c.1141G>C heterozygotes and individuals homozygous for rs532464664 had their hip replacement operation 13.5 years and 4.9 years earlier than others (P = 0.0020 and P = 0.0026), respectively. We show that the full-length CHADL transcript is expressed in cartilage. Furthermore, the premature stop codon introduced by the CHADL frameshift mutation results in nonsense-mediated decay of the mutant transcripts.
Assuntos
Proteína de Matriz Oligomérica de Cartilagem/genética , Proteínas da Matriz Extracelular/genética , Predisposição Genética para Doença/genética , Genoma Humano/genética , Osteoartrite do Quadril/genética , Análise de Sequência de DNA/métodos , Artroplastia de Quadril , Células Cultivadas , Códon sem Sentido , Mutação da Fase de Leitura , Expressão Gênica , Frequência do Gene , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Islândia , Estimativa de Kaplan-Meier , Mutação de Sentido Incorreto , Osteoartrite do Quadril/cirurgia , Polimorfismo de Nucleotídeo Único , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Fatores de RiscoRESUMO
The consensus approach to genome-wide association studies (GWAS) has been to assign equal prior probability of association to all sequence variants tested. However, some sequence variants, such as loss-of-function and missense variants, are more likely than others to affect protein function and are therefore more likely to be causative. Using data from whole-genome sequencing of 2,636 Icelanders and the association results for 96 quantitative and 123 binary phenotypes, we estimated the enrichment of association signals by sequence annotation. We propose a weighted Bonferroni adjustment that controls for the family-wise error rate (FWER), using as weights the enrichment of sequence annotations among association signals. We show that this weighted adjustment increases the power to detect association over the standard Bonferroni correction. We use the enrichment of associations by sequence annotation we have estimated in Iceland to derive significance thresholds for other populations with different numbers and combinations of sequence variants.
Assuntos
Estudo de Associação Genômica Ampla/métodos , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Humanos , Islândia/epidemiologia , Fenótipo , Análise de Sequência de DNARESUMO
Loss-of-function mutations cause many mendelian diseases. Here we aimed to create a catalog of autosomal genes that are completely knocked out in humans by rare loss-of-function mutations. We sequenced the whole genomes of 2,636 Icelanders and imputed the sequence variants identified in this set into 101,584 additional chip-genotyped and phased Icelanders. We found a total of 6,795 autosomal loss-of-function SNPs and indels in 4,924 genes. Of the genotyped Icelanders, 7.7% are homozygotes or compound heterozygotes for loss-of-function mutations with a minor allele frequency (MAF) below 2% in 1,171 genes (complete knockouts). Genes that are highly expressed in the brain are less often completely knocked out than other genes. Homozygous loss-of-function offspring of two heterozygous parents occurred less frequently than expected (deficit of 136 per 10,000 transmissions for variants with MAF <2%, 95% confidence interval (CI) = 10-261).
Assuntos
Mutação INDEL , Encéfalo/metabolismo , Feminino , Expressão Gênica , Frequência do Gene , Ontologia Genética , Genoma Humano , Heterozigoto , Homozigoto , Humanos , Masculino , Fases de Leitura Aberta , Especificidade de Órgãos , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNARESUMO
We have accumulated considerable data on the genetic makeup of the Icelandic population by sequencing the whole genomes of 2,636 Icelanders to depth of at least 10X and by chip genotyping 101,584 more. The sequencing was done with Illumina technology. The median sequencing depth was 20X and 909 individuals were sequenced to a depth of at least 30X. We found 20 million single nucleotide polymorphisms (SNPs) and 1.5 million insertions/deletions (indels) that passed stringent quality control. Almost all the common SNPs (derived allele frequency (DAF) over 2%) that we identified in Iceland have been observed by either dbSNP (build 137) or the Exome Sequencing Project (ESP) while only 60 and 20% of rare (DAF<0.5%) SNPs and indels in coding regions, the most heavily studied parts of the genome, have been observed in the public databases. Features of our variant data, such as the transition/transversion ratio and the length distribution of indels, are similar to published reports.
Assuntos
Genoma Humano , Análise de Sequência de DNA , Sequência de Bases , Frequência do Gene , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL , Islândia , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Here we describe the insights gained from sequencing the whole genomes of 2,636 Icelanders to a median depth of 20×. We found 20 million SNPs and 1.5 million insertions-deletions (indels). We describe the density and frequency spectra of sequence variants in relation to their functional annotation, gene position, pathway and conservation score. We demonstrate an excess of homozygosity and rare protein-coding variants in Iceland. We imputed these variants into 104,220 individuals down to a minor allele frequency of 0.1% and found a recessive frameshift mutation in MYL4 that causes early-onset atrial fibrillation, several mutations in ABCB4 that increase risk of liver diseases and an intronic variant in GNAS associating with increased thyroid-stimulating hormone levels when maternally inherited. These data provide a study design that can be used to determine how variation in the sequence of the human genome gives rise to human diversity.