RESUMEN
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Asunto(s)
Envejecimiento/genética , Mutación de Línea Germinal/genética , Edad Materna , Mutagénesis , Padres , Edad Paterna , Adolescente , Adulto , Anciano , Animales , Niño , Cromosomas Humanos Par 8/genética , Evolución Molecular , Femenino , Secuencia Rica en GC , Genoma Humano/genética , Gorilla gorilla/genética , Humanos , Mutación INDEL , Islandia , Desequilibrio de Ligamiento/genética , Masculino , Persona de Mediana Edad , Tasa de Mutación , Pan troglodytes/genética , Polimorfismo de Nucleótido Simple , Pongo/genética , Adulto JovenRESUMEN
Educational attainment is strongly influenced by social and other environmental factors, but genetic factors are estimated to account for at least 20% of the variation across individuals. Here we report the results of a genome-wide association study (GWAS) for educational attainment that extends our earlier discovery sample of 101,069 individuals to 293,723 individuals, and a replication study in an independent sample of 111,349 individuals from the UK Biobank. We identify 74 genome-wide significant loci associated with the number of years of schooling completed. Single-nucleotide polymorphisms associated with educational attainment are disproportionately found in genomic regions regulating gene expression in the fetal brain. Candidate genes are preferentially expressed in neural tissue, especially during the prenatal period, and enriched for biological pathways involved in neural development. Our findings demonstrate that, even for a behavioural phenotype that is mostly environmentally determined, a well-powered GWAS identifies replicable associated genetic variants that suggest biologically relevant pathways. Because educational attainment is measured in large numbers of individuals, it will continue to be useful as a proxy phenotype in efforts to characterize the genetic influences of related phenotypes, including cognition and neuropsychiatric diseases.
Asunto(s)
Encéfalo/metabolismo , Escolaridad , Feto/metabolismo , Regulación de la Expresión Génica/genética , Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple/genética , Enfermedad de Alzheimer/genética , Trastorno Bipolar/genética , Cognición , Biología Computacional , Interacción Gen-Ambiente , Humanos , Anotación de Secuencia Molecular , Esquizofrenia/genética , Reino UnidoRESUMEN
Epidemiological and genetic association studies show that genetics play an important role in the attainment of education. Here, we investigate the effect of this genetic component on the reproductive history of 109,120 Icelanders and the consequent impact on the gene pool over time. We show that an educational attainment polygenic score, POLYEDU, constructed from results of a recent study is associated with delayed reproduction (P < 10-100) and fewer children overall. The effect is stronger for women and remains highly significant after adjusting for educational attainment. Based on 129,808 Icelanders born between 1910 and 1990, we find that the average POLYEDU has been declining at a rate of â¼0.010 standard units per decade, which is substantial on an evolutionary timescale. Most importantly, because POLYEDU only captures a fraction of the overall underlying genetic component the latter could be declining at a rate that is two to three times faster.
Asunto(s)
Escolaridad , Variación Genética , Adolescente , Adulto , Femenino , Fertilidad , Genoma Humano , Genotipo , Humanos , Islandia , Inteligencia , Masculino , Adulto JovenRESUMEN
Clonal hematopoiesis (CH) arises when a substantial proportion of mature blood cells is derived from a single dominant hematopoietic stem cell lineage. Somatic mutations in candidate driver (CD) genes are thought to be responsible for at least some cases of CH. Using whole-genome sequencing of 11 262 Icelanders, we found 1403 cases of CH by using barcodes of mosaic somatic mutations in peripheral blood, whether or not they have a mutation in a CD gene. We find that CH is very common in the elderly, trending toward inevitability. We show that somatic mutations in TET2, DNMT3A, ASXL1, and PPM1D are associated with CH at high significance. However, known CD mutations were evident in only a fraction of CH cases. Nevertheless, the highly prevalent CH we detect associates with increased mortality rates, risk for hematological malignancy, smoking behavior, telomere length, Y-chromosome loss, and other phenotypic characteristics. Modeling suggests some CH cases could arise in the absence of CD mutations as a result of neutral drift acting on a small population of active hematopoietic stem cells. Finally, we find a germline deletion in intron 3 of the telomerase reverse transcriptase (TERT) gene that predisposes to CH (rs34002450; P = 7.4 × 10-12; odds ratio, 1.37).
Asunto(s)
ADN (Citosina-5-)-Metiltransferasas/genética , Proteínas de Unión al ADN/genética , Hematopoyesis , Células Madre Hematopoyéticas/citología , Mutación , Proteína Fosfatasa 2C/genética , Proteínas Proto-Oncogénicas/genética , Proteínas Represoras/genética , Adulto , Factores de Edad , Anciano , Anciano de 80 o más Años , Células Clonales , ADN Metiltransferasa 3A , Dioxigenasas , Femenino , Neoplasias Hematológicas/epidemiología , Neoplasias Hematológicas/genética , Células Madre Hematopoyéticas/metabolismo , Humanos , Masculino , Persona de Mediana Edad , Factores de RiesgoRESUMEN
Low bone mineral density (BMD) is used as a parameter of osteoporosis. Genome-wide association studies of BMD have hitherto focused on BMD as a quantitative trait, yielding common variants of small effects that contribute to the population diversity in BMD. Here we use BMD as a dichotomous trait, searching for variants that may have a direct effect on the risk of pathologically low BMD rather than on the regulation of BMD in the healthy population. Through whole-genome sequencing of Icelandic individuals, we found a rare nonsense mutation within the leucine-rich-repeat-containing G-protein-coupled receptor 4 (LGR4) gene (c.376C>T) that is strongly associated with low BMD, and with osteoporotic fractures. This mutation leads to termination of LGR4 at position 126 and fully disrupts its function. The c.376C>T mutation is also associated with electrolyte imbalance, late onset of menarche and reduced testosterone levels, as well as an increased risk of squamous cell carcinoma of the skin and biliary tract cancer. Interestingly, the phenotype of carriers of the c.376C>T mutation overlaps that of Lgr4 mutant mice.
Asunto(s)
Neoplasias del Sistema Biliar/genética , Densidad Ósea/genética , Carcinoma de Células Escamosas/genética , Codón sin Sentido/genética , Fracturas Osteoporóticas/genética , Receptores Acoplados a Proteínas G/genética , Neoplasias Cutáneas/genética , Desequilibrio Hidroelectrolítico/genética , Animales , Australia , Dinamarca , Regulación hacia Abajo/genética , Femenino , Heterocigoto , Humanos , Islandia , Masculino , Menarquia/genética , Ratones , Ratones Noqueados , Fenotipo , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/deficiencia , Receptores Acoplados a Proteínas G/metabolismo , Testosterona/análisisRESUMEN
Mutation of the DNA molecule is one of the most fundamental processes in biology. In this study, we use 283 parent-offspring trios to estimate the rate of mutation for both single nucleotide variants (SNVs) and short length variants (indels) in humans and examine the mutation process. We found 17812 SNVs, corresponding to a mutation rate of 1.29 × 10-8 per position per generation (PPPG) and 1282 indels corresponding to a rate of 9.29 × 10-10 PPPG. We estimate that around 3% of human de novo SNVs are part of a multi-nucleotide mutation (MNM), with 558 (3.1%) of mutations positioned less than 20kb from another mutation in the same individual (median distance of 525bp). The rate of de novo mutations is greater in late replicating regions (p = 8.29 × 10-19) and nearer recombination events (p = 0.0038) than elsewhere in the genome.
Asunto(s)
Genoma Humano , Mutación INDEL/genética , Tasa de Mutación , Análisis Mutacional de ADN , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Polimorfismo de Nucleótido Simple/genética , Recombinación Genética/genéticaRESUMEN
Transcriptional and splicing anomalies have been observed in intron 8 of the CASP8 gene (encoding procaspase-8) in association with cutaneous basal-cell carcinoma (BCC) and linked to a germline SNP rs700635. Here, we show that the rs700635[C] allele, which is associated with increased risk of BCC and breast cancer, is protective against prostate cancer [odds ratio (OR) = 0.91, P = 1.0 × 10(-6)]. rs700635[C] is also associated with failures to correctly splice out CASP8 intron 8 in breast and prostate tumours and in corresponding normal tissues. Investigation of rs700635[C] carriers revealed that they have a human-specific short interspersed element-variable number of tandem repeat-Alu (SINE-VNTR-Alu), subfamily-E retrotransposon (SVA-E) inserted into CASP8 intron 8. The SVA-E shows evidence of prior activity, because it has transduced some CASP8 sequences during subsequent retrotransposition events. Whole-genome sequence (WGS) data were used to tag the SVA-E with a surrogate SNP rs1035142[T] (r(2) = 0.999), which showed associations with both the splicing anomalies (P = 6.5 × 10(-32)) and with protection against prostate cancer (OR = 0.91, P = 3.8 × 10(-7)).
Asunto(s)
Neoplasias de la Mama/genética , Carcinoma Basocelular/genética , Caspasa 8/genética , Neoplasias de la Próstata/genética , Empalme del ARN , Retroelementos , Neoplasias Cutáneas/genética , Adulto , Anciano , Anciano de 80 o más Años , Alelos , Secuencia de Bases , Neoplasias de la Mama/metabolismo , Neoplasias de la Mama/patología , Carcinoma Basocelular/metabolismo , Carcinoma Basocelular/patología , Caspasa 8/metabolismo , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Intrones , Masculino , Persona de Mediana Edad , Datos de Secuencia Molecular , Oportunidad Relativa , Polimorfismo de Nucleótido Simple , Neoplasias de la Próstata/metabolismo , Neoplasias de la Próstata/patología , Neoplasias de la Próstata/prevención & control , Factores Protectores , Neoplasias Cutáneas/metabolismo , Neoplasias Cutáneas/patologíaRESUMEN
Mutations generate sequence diversity and provide a substrate for selection. The rate of de novo mutations is therefore of major importance to evolution. Here we conduct a study of genome-wide mutation rates by sequencing the entire genomes of 78 Icelandic parent-offspring trios at high coverage. We show that in our samples, with an average father's age of 29.7, the average de novo mutation rate is 1.20 × 10(-8) per nucleotide per generation. Most notably, the diversity in mutation rate of single nucleotide polymorphisms is dominated by the age of the father at conception of the child. The effect is an increase of about two mutations per year. An exponential model estimates paternal mutations doubling every 16.5 years. After accounting for random Poisson variation, father's age is estimated to explain nearly all of the remaining variation in the de novo mutation counts. These observations shed light on the importance of the father's age on the risk of diseases such as schizophrenia and autism.
Asunto(s)
Trastorno Autístico/genética , Predisposición Genética a la Enfermedad , Tasa de Mutación , Edad Paterna , Esquizofrenia/genética , Adulto , Trastorno Autístico/epidemiología , Trastorno Autístico/etiología , Cromosomas Humanos/genética , Femenino , Genoma Humano/genética , Humanos , Islandia/epidemiología , Masculino , Persona de Mediana Edad , Madres , Óvulo/metabolismo , Linaje , Polimorfismo de Nucleótido Simple/genética , Factores de Riesgo , Esquizofrenia/epidemiología , Esquizofrenia/etiología , Selección Genética/genética , Análisis de Secuencia de ADN , Espermatozoides/metabolismo , Adulto JovenRESUMEN
The prevalence of dementia in the Western world in people over the age of 60 has been estimated to be greater than 5%, about two-thirds of which are due to Alzheimer's disease. The age-specific prevalence of Alzheimer's disease nearly doubles every 5 years after age 65, leading to a prevalence of greater than 25% in those over the age of 90 (ref. 3). Here, to search for low-frequency variants in the amyloid-ß precursor protein (APP) gene with a significant effect on the risk of Alzheimer's disease, we studied coding variants in APP in a set of whole-genome sequence data from 1,795 Icelanders. We found a coding mutation (A673T) in the APP gene that protects against Alzheimer's disease and cognitive decline in the elderly without Alzheimer's disease. This substitution is adjacent to the aspartyl protease ß-site in APP, and results in an approximately 40% reduction in the formation of amyloidogenic peptides in vitro. The strong protective effect of the A673T substitution against Alzheimer's disease provides proof of principle for the hypothesis that reducing the ß-cleavage of APP may protect against the disease. Furthermore, as the A673T allele also protects against cognitive decline in the elderly without Alzheimer's disease, the two may be mediated through the same or similar mechanisms.
Asunto(s)
Envejecimiento/genética , Enfermedad de Alzheimer/genética , Precursor de Proteína beta-Amiloide/genética , Precursor de Proteína beta-Amiloide/metabolismo , Trastornos del Conocimiento/genética , Trastornos del Conocimiento/fisiopatología , Mutación/genética , Alelos , Enfermedad de Alzheimer/patología , Enfermedad de Alzheimer/fisiopatología , Enfermedad de Alzheimer/prevención & control , Secretasas de la Proteína Precursora del Amiloide/metabolismo , Precursor de Proteína beta-Amiloide/química , Ácido Aspártico Endopeptidasas/metabolismo , Cognición/fisiología , Trastornos del Conocimiento/prevención & control , Predisposición Genética a la Enfermedad , Células HEK293 , Humanos , Placa Amiloide/genética , Placa Amiloide/metabolismoRESUMEN
AIMS: Atrial fibrillation (AF) is the most common sustained cardiac arrhythmia in man, causing substantial morbidity and mortality with a major worldwide public health impact. It is increasingly recognized as a highly heritable condition. This study aimed to determine genetic risk factors for early-onset AF. METHODS AND RESULTS: We sequenced the whole genomes of 8453 Icelanders and imputed genotypes of the 25.5 million sequence variants we discovered into 1799 Icelanders with early-onset AF (diagnosed before 60 years of age) and 337 453 controls. Each sequence variant was tested for association based on multiplicative and recessive inheritance models. We discovered a rare frameshift deletion in the myosin MYL4 gene (c.234delC) that associates with early-onset AF under a recessive mode of inheritance (allelic frequency = 0.58%). We found eight homozygous carriers of the mutation, all of whom had early-onset AF. Six of the homozygotes were diagnosed by the age of 30 and the remaining two in their 50s. Three of the homozygotes had received pacemaker implantations due to sick sinus syndrome, three had suffered an ischemic stroke, and one suffered sudden cardiac death. CONCLUSIONS: Through a population approach we found a loss of function mutation in the myosin gene MYL4 that, in the homozygous state, is completely penetrant for early-onset AF. The finding may provide novel mechanistic insight into the pathophysiology of this complex arrhythmia.
Asunto(s)
Fibrilación Atrial/genética , Mutación del Sistema de Lectura/genética , Cadenas Ligeras de Miosina/genética , Anciano , Fibrilación Atrial/etnología , Estudios de Casos y Controles , Muerte Súbita Cardíaca/etnología , Muerte Súbita Cardíaca/etiología , Femenino , Eliminación de Gen , Genes Recesivos/genética , Estudio de Asociación del Genoma Completo/métodos , Heterocigoto , Homocigoto , Humanos , Islandia/etnología , Masculino , Persona de Mediana Edad , Linaje , Factores de Riesgo , Sarcómeros , Alineación de Secuencia/métodos , Síndrome del Seno Enfermo/etnología , Síndrome del Seno Enfermo/genética , Accidente Cerebrovascular/etnología , Accidente Cerebrovascular/genéticaRESUMEN
Uncertainty about the phase of strings of SNPs creates complications in genetic analysis, although methods have been developed for phasing population-based samples. However, these methods can only phase a small number of SNPs effectively and become unreliable when applied to SNPs spanning many linkage disequilibrium (LD) blocks. Here we show how to phase more than 1,000 SNPs simultaneously for a large fraction of the 35,528 Icelanders genotyped by Illumina chips. Moreover, haplotypes that are identical by descent (IBD) between close and distant relatives, for example, those separated by ten meioses or more, can often be reliably detected. This method is particularly powerful in studies of the inheritance of recurrent mutations and fine-scale recombinations in large sample sets. A further extension of the method allows us to impute long haplotypes for individuals who are not genotyped.
Asunto(s)
Algoritmos , Haplotipos , Complejo Mayor de Histocompatibilidad , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Secuencia de Bases , Femenino , Eliminación de Gen , Marcadores Genéticos , Genética de Población , Humanos , Islandia , Patrón de Herencia , MasculinoRESUMEN
Chronic kidney disease (CKD) is a complex disorder with a strong genetic component. A number of common sequence variants have been found to associate with serum creatinine (SCr), estimated glomerular filtration rate (eGFR) and/or CKD. We imputed 24 million single-nucleotide polymorphisms and insertions/deletions identified by whole-genome sequencing of 2230 Icelanders into 81 656 chip-typed individuals and 112 630 relatives of genotyped individuals over the age of 18 with SCr measurements. The large set of sequenced individuals allowed accurate imputation of variants to a minor allele frequency (MAF) of 0.1%. We tested the imputed variants for association with SCr. In addition to replicating established loci, we discovered missense and loss-of-function variants associating with SCr in three solute carriers (SLC6A19, SLC25A45 and SLC47A1) and two E3 ubiquitin ligases (RNF186 and RNF128). All the variants are within coding sequences and all but one are rare (MAF <2%) with SCr effects between 0.085 and 0.129 standard deviations. These rare variants have a larger effect on SCr than previously reported common variants, explaining 0.5% of the variability of SCr in Icelanders in addition to the 1% already accounted for. We tested the five variants associating with SCr for association with CKD in an Icelandic sample of 15 594 cases and 291 428 controls. Three of the variants also associated with CKD. These variants may either affect kidney function or creatinine synthesis and excretion. Of note were four mutations in SLC6A19 that associate with reduced SCr, three of which have been shown to cause Hartnup disease.
Asunto(s)
Sistemas de Transporte de Aminoácidos Neutros/genética , Creatinina/sangre , Proteínas de la Membrana/genética , Proteínas Mitocondriales/genética , Proteínas de Transporte de Catión Orgánico/genética , Insuficiencia Renal Crónica/genética , Ubiquitina-Proteína Ligasas/genética , Adolescente , Adulto , Anciano , Alelos , Estudios de Casos y Controles , Femenino , Frecuencia de los Genes , Sitios Genéticos , Estudio de Asociación del Genoma Completo , Genotipo , Tasa de Filtración Glomerular , Humanos , Mutación INDEL , Islandia , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Insuficiencia Renal Crónica/sangre , Insuficiencia Renal Crónica/patologíaRESUMEN
To search for new sequence variants that confer risk of cutaneous basal cell carcinoma (BCC), we conducted a genome-wide association study of 38.5 million single nucleotide polymorphisms (SNPs) and small indels identified through whole-genome sequencing of 2230 Icelanders. We imputed genotypes for 4208 BCC patients and 109 408 controls using Illumina SNP chip typing data, carried out association tests and replicated the findings in independent population samples. We found new BCC susceptibility loci at TGM3 (rs214782[G], P = 5.5 × 10(-17), OR = 1.29) and RGS22 (rs7006527[C], P = 8.7 × 10(-13), OR = 0.77). TGM3 encodes transglutaminase type 3, which plays a key role in production of the cornified envelope during epidermal differentiation.
Asunto(s)
Antígenos de Superficie/genética , Carcinoma Basocelular/genética , Reguladores de Proteínas de Unión al GTP/genética , Variación Genética , Mutación de Línea Germinal , Transglutaminasas/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Niño , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Genotipo , Células Germinativas/metabolismo , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Neoplasias Cutáneas/genética , Adulto JovenRESUMEN
BACKGROUND: Sequence variants, including the ε4 allele of apolipoprotein E, have been associated with the risk of the common late-onset form of Alzheimer's disease. Few rare variants affecting the risk of late-onset Alzheimer's disease have been found. METHODS: We obtained the genome sequences of 2261 Icelanders and identified sequence variants that were likely to affect protein function. We imputed these variants into the genomes of patients with Alzheimer's disease and control participants and then tested for an association with Alzheimer's disease. We performed replication tests using case-control series from the United States, Norway, The Netherlands, and Germany. We also tested for a genetic association with cognitive function in a population of unaffected elderly persons. RESULTS: A rare missense mutation (rs75932628-T) in the gene encoding the triggering receptor expressed on myeloid cells 2 (TREM2), which was predicted to result in an R47H substitution, was found to confer a significant risk of Alzheimer's disease in Iceland (odds ratio, 2.92; 95% confidence interval [CI], 2.09 to 4.09; P=3.42×10(-10)). The mutation had a frequency of 0.46% in controls 85 years of age or older. We observed the association in additional sample sets (odds ratio, 2.90; 95% CI, 2.16 to 3.91; P=2.1×10(-12) in combined discovery and replication samples). We also found that carriers of rs75932628-T between the ages of 80 and 100 years without Alzheimer's disease had poorer cognitive function than noncarriers (P=0.003). CONCLUSIONS: Our findings strongly implicate variant TREM2 in the pathogenesis of Alzheimer's disease. Given the reported antiinflammatory role of TREM2 in the brain, the R47H substitution may lead to an increased predisposition to Alzheimer's disease through impaired containment of inflammatory processes. (Funded by the National Institute on Aging and others.).
Asunto(s)
Enfermedad de Alzheimer/genética , Glicoproteínas de Membrana/genética , Mutación Missense , Receptores Inmunológicos/genética , Anciano de 80 o más Años , Apolipoproteína E4/genética , Estudios de Casos y Controles , Cognición , Variación Genética , Técnicas de Genotipaje , Heterocigoto , Humanos , Islandia , Polimorfismo de Nucleótido Simple , Factores de Riesgo , Análisis de Secuencia de ADNRESUMEN
Although recent genome-wide studies have provided valuable insights into the genetic basis of human disease, they have explained relatively little of the heritability of most complex traits, and the variants identified through these studies have small effect sizes. This has led to the important and hotly debated issue of where the 'missing heritability' of complex diseases might be found. Here, seven leading geneticists offer their opinion about where this heritability is likely to lie, what this could tell us about the underlying genetic architecture of common diseases and how this could inform research strategies for uncovering genetic risk factors.
Asunto(s)
Enfermedad/etiología , Estudio de Asociación del Genoma Completo/métodos , Estudio de Asociación del Genoma Completo/normas , Patrón de Herencia/genética , Proyectos de Investigación , Enfermedad/genética , Predisposición Genética a la Enfermedad/epidemiología , Predisposición Genética a la Enfermedad/genética , Genotipo , Humanos , Fenotipo , Polimorfismo Genético/fisiología , Carácter Cuantitativo Heredable , Proyectos de Investigación/normasRESUMEN
Meiotic recombinations contribute to genetic diversity by yielding new combinations of alleles. Recently, high-resolution recombination maps were inferred from high-density single-nucleotide polymorphism (SNP) data using linkage disequilibrium (LD) patterns that capture historical recombination events. The use of these maps has been demonstrated by the identification of recombination hotspots and associated motifs, and the discovery that the PRDM9 gene affects the proportion of recombinations occurring at hotspots. However, these maps provide no information about individual or sex differences. Moreover, locus-specific demographic factors like natural selection can bias LD-based estimates of recombination rate. Existing genetic maps based on family data avoid these shortcomings, but their resolution is limited by relatively few meioses and a low density of markers. Here we used genome-wide SNP data from 15,257 parent-offspring pairs to construct the first recombination maps based on directly observed recombinations with a resolution that is effective down to 10 kilobases (kb). Comparing male and female maps reveals that about 15% of hotspots in one sex are specific to that sex. Although male recombinations result in more shuffling of exons within genes, female recombinations generate more new combinations of nearby genes. We discover novel associations between recombination characteristics of individuals and variants in the PRDM9 gene and we identify new recombination hotspots. Comparisons of our maps with two LD-based maps inferred from data of HapMap populations of Utah residents with ancestry from northern and western Europe (CEU) and Yoruba in Ibadan, Nigeria (YRI) reveal population differences previously masked by noise and map differences at regions previously described as targets of natural selection.
Asunto(s)
Cromosomas Humanos/genética , Recombinación Genética/genética , Caracteres Sexuales , Alelos , Proteínas de Unión al ADN/genética , Europa (Continente)/etnología , Exones/genética , Femenino , Genética de Población , Haplotipos/genética , Heterocigoto , N-Metiltransferasa de Histona-Lisina/genética , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Meiosis/genética , Nigeria/etnología , Linaje , Polimorfismo de Nucleótido Simple/genética , Tamaño de la Muestra , Selección Genética/genética , UtahRESUMEN
Genome-wide association studies have mainly relied on common HapMap sequence variations. Recently, sequencing approaches have allowed analysis of low frequency and rare variants in conjunction with common variants, thereby improving the search for functional variants and thus the understanding of the underlying biology of human traits and diseases. Here, we used a large Icelandic whole genome sequence dataset combined with Danish exome sequence data to gain insight into the genetic architecture of serum levels of vitamin B(12) (B12) and folate. Up to 22.9 million sequence variants were analyzed in combined samples of 45,576 and 37,341 individuals with serum B(12) and folate measurements, respectively. We found six novel loci associating with serum B(12) (CD320, TCN2, ABCD4, MMAA, MMACHC) or folate levels (FOLR3) and confirmed seven loci for these traits (TCN1, FUT6, FUT2, CUBN, CLYBL, MUT, MTHFR). Conditional analyses established that four loci contain additional independent signals. Interestingly, 13 of the 18 identified variants were coding and 11 of the 13 target genes have known functions related to B(12) and folate pathways. Contrary to epidemiological studies we did not find consistent association of the variants with cardiovascular diseases, cancers or Alzheimer's disease although some variants demonstrated pleiotropic effects. Although to some degree impeded by low statistical power for some of these conditions, these data suggest that sequence variants that contribute to the population diversity in serum B(12) or folate levels do not modify the risk of developing these conditions. Yet, the study demonstrates the value of combining whole genome and exome sequencing approaches to ascertain the genetic and molecular architectures underlying quantitative trait associations.
Asunto(s)
Deficiencia de Ácido Fólico/genética , Ácido Fólico/sangre , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Vitamina B 12/sangre , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/metabolismo , Dinamarca , Exoma , Ácido Fólico/metabolismo , Deficiencia de Ácido Fólico/metabolismo , Genoma Humano , Humanos , Islandia , Metilenotetrahidrofolato Reductasa (NADPH2)/genética , Vitamina B 12/metabolismoRESUMEN
We recently described an association between risk of type 2diabetes and variants in the transcription factor 7-like 2 gene (TCF7L2; formerly TCF4), with a population attributable risk (PAR) of 17%-28% in three populations of European ancestry. Here, we refine the definition of the TCF7L2 type 2diabetes risk variant, HapB(T2D), to the ancestral T allele of a SNP, rs7903146, through replication in West African and Danish type 2 diabetes case-control studies and an expanded Icelandic study. We also identify another variant of the same gene, HapA, that shows evidence of positive selection in East Asian, European and West African populations. Notably, HapA shows a suggestive association with body mass index and altered concentrations of the hunger-satiety hormones ghrelin and leptin in males, indicating that the selective advantage of HapA may have been mediated through effects on energy metabolism.
Asunto(s)
Evolución Biológica , Diabetes Mellitus Tipo 2/genética , Polimorfismo de Nucleótido Simple , Factores de Transcripción TCF/genética , Pueblo Asiatico , Población Negra , Índice de Masa Corporal , Estudios de Casos y Controles , Femenino , Predisposición Genética a la Enfermedad , Variación Genética , Haplotipos , Humanos , Islandia , Masculino , Riesgo , Selección Genética , Proteína 2 Similar al Factor de Transcripción 7 , Población BlancaRESUMEN
We conducted a genome-wide association study for type 2 diabetes (T2D) in Icelandic cases and controls, and we found that a previously described variant in the transcription factor 7-like 2 gene (TCF7L2) gene conferred the most significant risk. In addition to confirming two recently identified risk variants, we identified a variant in the CDKAL1 gene that was associated with T2D in individuals of European ancestry (allele-specific odds ratio (OR) = 1.20 (95% confidence interval, 1.13-1.27), P = 7.7 x 10(-9)) and individuals from Hong Kong of Han Chinese ancestry (OR = 1.25 (1.11-1.40), P = 0.00018). The genotype OR of this variant suggested that the effect was substantially stronger in homozygous carriers than in heterozygous carriers. The ORs for homozygotes were 1.50 (1.31-1.72) and 1.55 (1.23-1.95) in the European and Hong Kong groups, respectively. The insulin response for homozygotes was approximately 20% lower than for heterozygotes or noncarriers, suggesting that this variant confers risk of T2D through reduced insulin secretion.
Asunto(s)
Proteínas Portadoras/genética , Diabetes Mellitus Tipo 2/genética , Resistencia a la Insulina/genética , Péptidos y Proteínas de Señalización Intracelular/genética , Proteínas del Tejido Nervioso/genética , Polimorfismo de Nucleótido Simple , Adulto , Glucemia/metabolismo , Estudios de Casos y Controles , Estudios Transversales , Femenino , Frecuencia de los Genes , Genoma Humano , Humanos , Insulina/metabolismo , Secreción de Insulina , Desequilibrio de Ligamiento , Masculino , Persona de Mediana Edad , Factores de Transcripción TCF/genética , Proteína 1 Similar al Factor de Transcripción 7 , Proteína 2 Similar al Factor de Transcripción 7RESUMEN
Prostate cancer is the most prevalent noncutaneous cancer in males in developed regions, with African American men having among the highest worldwide incidence and mortality rates. Here we report a second genetic variant in the 8q24 region that, in conjunction with another variant we recently discovered, accounts for about 11%-13% of prostate cancer cases in individuals of European descent and 31% of cases in African Americans. We made the current discovery through a genome-wide association scan of 1,453 affected Icelandic individuals and 3,064 controls using the Illumina HumanHap300 BeadChip followed by four replication studies. A key step in the discovery was the construction of a 14-SNP haplotype that efficiently tags a relatively uncommon (2%-4%) susceptibility variant in individuals of European descent that happens to be very common (approximately 42%) in African Americans. The newly identified variant shows a stronger association with affected individuals who have an earlier age at diagnosis.