RESUMEN
We examined the rate and nature of mitochondrial DNA (mtDNA) mutations in humans using sequence data from 64,806 contemporary Icelanders from 2,548 matrilines. Based on 116,663 mother-child transmissions, 8,199 mutations were detected, providing robust rate estimates by nucleotide type, functional impact, position, and different alleles at the same position. We thoroughly document the true extent of hypermutability in mtDNA, mainly affecting the control region but also some coding-region variants. The results reveal the impact of negative selection on viable deleterious mutations, including rapidly mutating disease-associated 3243A>G and 1555A>G and pre-natal selection that most likely occurs during the development of oocytes. Finally, we show that the fate of new mutations is determined by a drastic germline bottleneck, amounting to an average of 3 mtDNA units effectively transmitted from mother to child.
Asunto(s)
ADN Mitocondrial , Linaje , Humanos , ADN Mitocondrial/genética , Femenino , Islandia , Masculino , Mutación , Tasa de MutaciónRESUMEN
Human genetic studies of common variants have provided substantial insight into the biological mechanisms that govern ovarian ageing1. Here we report analyses of rare protein-coding variants in 106,973 women from the UK Biobank study, implicating genes with effects around five times larger than previously found for common variants (ETAA1, ZNF518A, PNPLA8, PALB2 and SAMHD1). The SAMHD1 association reinforces the link between ovarian ageing and cancer susceptibility1, with damaging germline variants being associated with extended reproductive lifespan and increased all-cause cancer risk in both men and women. Protein-truncating variants in ZNF518A are associated with shorter reproductive lifespan-that is, earlier age at menopause (by 5.61 years) and later age at menarche (by 0.56 years). Finally, using 8,089 sequenced trios from the 100,000 Genomes Project (100kGP), we observe that common genetic variants associated with earlier ovarian ageing associate with an increased rate of maternally derived de novo mutations. Although we were unable to replicate the finding in independent samples from the deCODE study, it is consistent with the expected role of DNA damage response genes in maintaining the genetic integrity of germ cells. This study provides evidence of genetic links between age of menopause and cancer risk.
Asunto(s)
Envejecimiento , Predisposición Genética a la Enfermedad , Menopausia , Tasa de Mutación , Neoplasias , Ovario , Adulto , Femenino , Humanos , Masculino , Persona de Mediana Edad , Envejecimiento/genética , Envejecimiento/patología , Daño del ADN/genética , Fertilidad/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética/genética , Genoma Humano/genética , Mutación de Línea Germinal/genética , Menarquia/genética , Menopausia/genética , Neoplasias/genética , Ovario/metabolismo , Ovario/patología , Factores de Tiempo , Biobanco del Reino Unido , Reino Unido/epidemiologíaRESUMEN
BACKGROUND: In 2021, the American College of Medical Genetics and Genomics (ACMG) recommended reporting actionable genotypes in 73 genes associated with diseases for which preventive or therapeutic measures are available. Evaluations of the association of actionable genotypes in these genes with life span are currently lacking. METHODS: We assessed the prevalence of coding and splice variants in genes on the ACMG Secondary Findings, version 3.0 (ACMG SF v3.0), list in the genomes of 57,933 Icelanders. We assigned pathogenicity to all reviewed variants using reported evidence in the ClinVar database, the frequency of variants, and their associations with disease to create a manually curated set of actionable genotypes (variants). We assessed the relationship between these genotypes and life span and further examined the specific causes of death among carriers. RESULTS: Through manual curation of 4405 sequence variants in the ACMG SF v3.0 genes, we identified 235 actionable genotypes in 53 genes. Of the 57,933 participants, 2306 (4.0%) carried at least one actionable genotype. We found shorter median survival among persons carrying actionable genotypes than among noncarriers. Specifically, we found that carrying an actionable genotype in a cancer gene was associated with survival that was 3 years shorter than that among noncarriers, with causes of death among carriers attributed primarily to cancer-related conditions. Furthermore, we found evidence of association between carrying an actionable genotype in certain genes in the cardiovascular disease group and a reduced life span. CONCLUSIONS: On the basis of the ACMG SF v3.0 guidelines, we found that approximately 1 in 25 Icelanders carried an actionable genotype and that carrying such a genotype was associated with a reduced life span. (Funded by deCODE Genetics-Amgen.).
Asunto(s)
Enfermedad , Genómica , Longevidad , Humanos , Alelos , Pruebas Genéticas , Variación Genética , Genotipo , Islandia/epidemiología , Longevidad/genética , Enfermedad/genética , Enfermedades Cardiovasculares/genética , Neoplasias/genéticaRESUMEN
Human evolutionary history is rich with the interbreeding of divergent populations. Most humans outside of Africa trace about 2% of their genomes to admixture from Neanderthals, which occurred 50-60 thousand years ago1. Here we examine the effect of this event using 14.4 million putative archaic chromosome fragments that were detected in fully phased whole-genome sequences from 27,566 Icelanders, corresponding to a range of 56,388-112,709 unique archaic fragments that cover 38.0-48.2% of the callable genome. On the basis of the similarity with known archaic genomes, we assign 84.5% of fragments to an Altai or Vindija Neanderthal origin and 3.3% to Denisovan origin; 12.2% of fragments are of unknown origin. We find that Icelanders have more Denisovan-like fragments than expected through incomplete lineage sorting. This is best explained by Denisovan gene flow, either into ancestors of the introgressing Neanderthals or directly into humans. A within-individual, paired comparison of archaic fragments with syntenic non-archaic fragments revealed that, although the overall rate of mutation was similar in humans and Neanderthals during the 500 thousand years that their lineages were separate, there were differences in the relative frequencies of mutation types-perhaps due to different generation intervals for males and females. Finally, we assessed 271 phenotypes, report 5 associations driven by variants in archaic fragments and show that the majority of previously reported associations are better explained by non-archaic variants.
Asunto(s)
Introgresión Genética/genética , Genoma Humano/genética , Genómica , Mutación , Hombre de Neandertal/genética , Animales , Femenino , Estudios de Asociación Genética , Haploidia , Humanos , Islandia , Masculino , Fenotipo , FilogeniaRESUMEN
BACKGROUND: During the current worldwide pandemic, coronavirus disease 2019 (Covid-19) was first diagnosed in Iceland at the end of February. However, data are limited on how SARS-CoV-2, the virus that causes Covid-19, enters and spreads in a population. METHODS: We targeted testing to persons living in Iceland who were at high risk for infection (mainly those who were symptomatic, had recently traveled to high-risk countries, or had contact with infected persons). We also carried out population screening using two strategies: issuing an open invitation to 10,797 persons and sending random invitations to 2283 persons. We sequenced SARS-CoV-2 from 643 samples. RESULTS: As of April 4, a total of 1221 of 9199 persons (13.3%) who were recruited for targeted testing had positive results for infection with SARS-CoV-2. Of those tested in the general population, 87 (0.8%) in the open-invitation screening and 13 (0.6%) in the random-population screening tested positive for the virus. In total, 6% of the population was screened. Most persons in the targeted-testing group who received positive tests early in the study had recently traveled internationally, in contrast to those who tested positive later in the study. Children under 10 years of age were less likely to receive a positive result than were persons 10 years of age or older, with percentages of 6.7% and 13.7%, respectively, for targeted testing; in the population screening, no child under 10 years of age had a positive result, as compared with 0.8% of those 10 years of age or older. Fewer females than males received positive results both in targeted testing (11.0% vs. 16.7%) and in population screening (0.6% vs. 0.9%). The haplotypes of the sequenced SARS-CoV-2 viruses were diverse and changed over time. The percentage of infected participants that was determined through population screening remained stable for the 20-day duration of screening. CONCLUSIONS: In a population-based study in Iceland, children under 10 years of age and females had a lower incidence of SARS-CoV-2 infection than adolescents or adults and males. The proportion of infected persons identified through population screening did not change substantially during the screening period, which was consistent with a beneficial effect of containment efforts. (Funded by deCODE Genetics-Amgen.).
Asunto(s)
Infecciones por Coronavirus/epidemiología , Monitoreo Epidemiológico , Neumonía Viral/epidemiología , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Betacoronavirus/genética , COVID-19 , Niño , Preescolar , Trazado de Contacto , Femenino , Haplotipos , Humanos , Islandia/epidemiología , Lactante , Masculino , Tamizaje Masivo , Persona de Mediana Edad , Pandemias , SARS-CoV-2 , Viaje , Adulto JovenRESUMEN
BACKGROUND: Little is known about the nature and durability of the humoral immune response to infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). METHODS: We measured antibodies in serum samples from 30,576 persons in Iceland, using six assays (including two pan-immunoglobulin [pan-Ig] assays), and we determined that the appropriate measure of seropositivity was a positive result with both pan-Ig assays. We tested 2102 samples collected from 1237 persons up to 4 months after diagnosis by a quantitative polymerase-chain-reaction (qPCR) assay. We measured antibodies in 4222 quarantined persons who had been exposed to SARS-CoV-2 and in 23,452 persons not known to have been exposed. RESULTS: Of the 1797 persons who had recovered from SARS-CoV-2 infection, 1107 of the 1215 who were tested (91.1%) were seropositive; antiviral antibody titers assayed by two pan-Ig assays increased during 2 months after diagnosis by qPCR and remained on a plateau for the remainder of the study. Of quarantined persons, 2.3% were seropositive; of those with unknown exposure, 0.3% were positive. We estimate that 0.9% of Icelanders were infected with SARS-CoV-2 and that the infection was fatal in 0.3%. We also estimate that 56% of all SARS-CoV-2 infections in Iceland had been diagnosed with qPCR, 14% had occurred in quarantined persons who had not been tested with qPCR (or who had not received a positive result, if tested), and 30% had occurred in persons outside quarantine and not tested with qPCR. CONCLUSIONS: Our results indicate that antiviral antibodies against SARS-CoV-2 did not decline within 4 months after diagnosis. We estimate that the risk of death from infection was 0.3% and that 44% of persons infected with SARS-CoV-2 in Iceland were not diagnosed by qPCR.
Asunto(s)
Infecciones por Coronavirus/inmunología , Inmunidad Humoral , Neumonía Viral/inmunología , Estudios Seroepidemiológicos , Adulto , Anciano , Anticuerpos Antivirales/sangre , Betacoronavirus , COVID-19 , Infecciones por Coronavirus/mortalidad , Femenino , Humanos , Islandia/epidemiología , Masculino , Persona de Mediana Edad , Pandemias , Neumonía Viral/mortalidad , Reacción en Cadena de la Polimerasa , Cuarentena , SARS-CoV-2RESUMEN
The characterization of mutational processes that generate sequence diversity in the human genome is of paramount importance both to medical genetics and to evolutionary studies. To understand how the age and sex of transmitting parents affect de novo mutations, here we sequence 1,548 Icelanders, their parents, and, for a subset of 225, at least one child, to 35× genome-wide coverage. We find 108,778 de novo mutations, both single nucleotide polymorphisms and indels, and determine the parent of origin of 42,961. The number of de novo mutations from mothers increases by 0.37 per year of age (95% CI 0.32-0.43), a quarter of the 1.51 per year from fathers (95% CI 1.45-1.57). The number of clustered mutations increases faster with the mother's age than with the father's, and the genomic span of maternal de novo mutation clusters is greater than that of paternal ones. The types of de novo mutation from mothers change substantially with age, with a 0.26% (95% CI 0.19-0.33%) decrease in cytosine-phosphate-guanine to thymine-phosphate-guanine (CpG>TpG) de novo mutations and a 0.33% (95% CI 0.28-0.38%) increase in C>G de novo mutations per year, respectively. Remarkably, these age-related changes are not distributed uniformly across the genome. A striking example is a 20 megabase region on chromosome 8p, with a maternal C>G mutation rate that is up to 50-fold greater than the rest of the genome. The age-related accumulation of maternal non-crossover gene conversions also mostly occurs within these regions. Increased sequence diversity and linkage disequilibrium of C>G variants within regions affected by excess maternal mutations indicate that the underlying mutational process has persisted in humans for thousands of years. Moreover, the regional excess of C>G variation in humans is largely shared by chimpanzees, less by gorillas, and is almost absent from orangutans. This demonstrates that sequence diversity in humans results from evolving interactions between age, sex, mutation type, and genomic location.
Asunto(s)
Envejecimiento/genética , Mutación de Línea Germinal/genética , Edad Materna , Mutagénesis , Padres , Edad Paterna , Adolescente , Adulto , Anciano , Animales , Niño , Cromosomas Humanos Par 8/genética , Evolución Molecular , Femenino , Secuencia Rica en GC , Genoma Humano/genética , Gorilla gorilla/genética , Humanos , Mutación INDEL , Islandia , Desequilibrio de Ligamiento/genética , Masculino , Persona de Mediana Edad , Tasa de Mutación , Pan troglodytes/genética , Polimorfismo de Nucleótido Simple , Pongo/genética , Adulto JovenRESUMEN
Objective: Familial hypercholesterolemia (FH) is traditionally defined as a monogenic disease characterized by severely elevated LDL-C (low-density lipoprotein cholesterol) levels. In practice, FH is commonly a clinical diagnosis without confirmation of a causative mutation. In this study, we sought to characterize and compare monogenic and clinically defined FH in a large sample of Icelanders. Approach and Results: We whole-genome sequenced 49 962 Icelanders and imputed the identified variants into an overall sample of 166 281 chip-genotyped Icelanders. We identified 20 FH mutations in LDLR, APOB, and PCSK9 with combined prevalence of 1 in 836. Monogenic FH was associated with severely elevated LDL-C levels and increased risk of premature coronary disease, aortic valve stenosis, and high burden of coronary atherosclerosis. We used a modified version of the Dutch Lipid Clinic Network criteria to screen for the clinical FH phenotype among living adult participants (N=79 058). Clinical FH was found in 2.2% of participants, of whom only 5.2% had monogenic FH. Mutation-negative clinical FH has a strong polygenic basis. Both individuals with monogenic FH and individuals with mutation-negative clinical FH were markedly undertreated with cholesterol-lowering medications and only a minority attained an LDL-C target of <2.6 mmol/L (<100 mg/dL; 11.0% and 24.9%, respectively) or <1.8 mmol/L (<70 mg/dL; 0.0% and 5.2%, respectively), as recommended for primary prevention by European Society of Cardiology/European Atherosclerosis Society cholesterol guidelines. Conclusions: Clinically defined FH is a relatively common phenotype that is explained by monogenic FH in only a minority of cases. Both monogenic and clinical FH confer high cardiovascular risk but are markedly undertreated.
Asunto(s)
Apolipoproteína B-100/genética , Enfermedades Cardiovasculares/genética , Hiperlipoproteinemia Tipo II/genética , Lípidos/sangre , Mutación , Proproteína Convertasa 9/genética , Receptores de LDL/genética , Adulto , Anciano , Anciano de 80 o más Años , Biomarcadores/sangre , Enfermedades Cardiovasculares/diagnóstico , Enfermedades Cardiovasculares/etnología , Enfermedades Cardiovasculares/terapia , Femenino , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Humanos , Inhibidores de Hidroximetilglutaril-CoA Reductasas/uso terapéutico , Hiperlipoproteinemia Tipo II/diagnóstico , Hiperlipoproteinemia Tipo II/tratamiento farmacológico , Hiperlipoproteinemia Tipo II/etnología , Islandia/epidemiología , Masculino , Persona de Mediana Edad , Fenotipo , Prevalencia , Pronóstico , Medición de Riesgo , Factores de Riesgo , Adulto JovenRESUMEN
Kennewick Man, referred to as the Ancient One by Native Americans, is a male human skeleton discovered in Washington state (USA) in 1996 and initially radiocarbon dated to 8,340-9,200 calibrated years before present (BP). His population affinities have been the subject of scientific debate and legal controversy. Based on an initial study of cranial morphology it was asserted that Kennewick Man was neither Native American nor closely related to the claimant Plateau tribes of the Pacific Northwest, who claimed ancestral relationship and requested repatriation under the Native American Graves Protection and Repatriation Act (NAGPRA). The morphological analysis was important to judicial decisions that Kennewick Man was not Native American and that therefore NAGPRA did not apply. Instead of repatriation, additional studies of the remains were permitted. Subsequent craniometric analysis affirmed Kennewick Man to be more closely related to circumpacific groups such as the Ainu and Polynesians than he is to modern Native Americans. In order to resolve Kennewick Man's ancestry and affiliations, we have sequenced his genome to â¼1× coverage and compared it to worldwide genomic data including for the Ainu and Polynesians. We find that Kennewick Man is closer to modern Native Americans than to any other population worldwide. Among the Native American groups for whom genome-wide data are available for comparison, several seem to be descended from a population closely related to that of Kennewick Man, including the Confederated Tribes of the Colville Reservation (Colville), one of the five tribes claiming Kennewick Man. We revisit the cranial analyses and find that, as opposed to genome-wide comparisons, it is not possible on that basis to affiliate Kennewick Man to specific contemporary groups. We therefore conclude based on genetic comparisons that Kennewick Man shows continuity with Native North Americans over at least the last eight millennia.
Asunto(s)
Indígenas Norteamericanos/genética , Filogenia , Esqueleto , Américas , Genoma Humano/genética , Genómica , Humanos , Masculino , Cráneo/anatomía & histología , WashingtónRESUMEN
Yakutia, Sakha Republic, in the Siberian Far East, represents one of the coldest places on Earth, with winter record temperatures dropping below -70 °C. Nevertheless, Yakutian horses survive all year round in the open air due to striking phenotypic adaptations, including compact body conformations, extremely hairy winter coats, and acute seasonal differences in metabolic activities. The evolutionary origins of Yakutian horses and the genetic basis of their adaptations remain, however, contentious. Here, we present the complete genomes of nine present-day Yakutian horses and two ancient specimens dating from the early 19th century and â¼5,200 y ago. By comparing these genomes with the genomes of two Late Pleistocene, 27 domesticated, and three wild Przewalski's horses, we find that contemporary Yakutian horses do not descend from the native horses that populated the region until the mid-Holocene, but were most likely introduced following the migration of the Yakut people a few centuries ago. Thus, they represent one of the fastest cases of adaptation to the extreme temperatures of the Arctic. We find cis-regulatory mutations to have contributed more than nonsynonymous changes to their adaptation, likely due to the comparatively limited standing variation within gene bodies at the time the population was founded. Genes involved in hair development, body size, and metabolic and hormone signaling pathways represent an essential part of the Yakutian horse adaptive genetic toolkit. Finally, we find evidence for convergent evolution with native human populations and woolly mammoths, suggesting that only a few evolutionary strategies are compatible with survival in extremely cold environments.
Asunto(s)
Adaptación Fisiológica/genética , Frío , Caballos/fisiología , Animales , Regiones Árticas , Evolución Molecular , Genoma , Caballos/genética , SiberiaRESUMEN
MOTIVATION: Large-scale genotype datasets can help track the dispersal patterns of epidemiological outbreaks and predict the geographic origins of individuals. Such genetically-based geographic assignments also show a range of possible applications in forensics for profiling both victims and criminals, and in wildlife management, where poaching hotspot areas can be located. They, however, require fast and accurate statistical methods to handle the growing amount of genetic information made available from genotype arrays and next-generation sequencing technologies. RESULTS: We introduce a novel statistical method for geopositioning individuals of unknown origin from genotypes. Our method is based on a geostatistical model trained with a dataset of georeferenced genotypes. Statistical inference under this model can be implemented within the theoretical framework of Integrated Nested Laplace Approximation, which represents one of the major recent breakthroughs in statistics, as it does not require Monte Carlo simulations. We compare the performance of our method and an alternative method for geospatial inference, SPA in a simulation framework. We highlight the accuracy and limits of continuous spatial assignment methods at various scales by analyzing genotype datasets from a diversity of species, including Florida Scrub-jay birds Aphelocoma coerulescens, Arabidopsis thaliana and humans, representing 41-197,146 SNPs. Our method appears to be best suited for the analysis of medium-sized datasets (a few tens of thousands of loci), such as reduced-representation sequencing data that become increasingly available in ecology. AVAILABILITY AND IMPLEMENTATION: http://www2.imm.dtu.dk/â¼gigu/Spasiba/ CONTACT: gilles.b.guillot@gmail.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Interpretación Estadística de Datos , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Polimorfismo de Nucleótido Simple , Humanos , Modelos Teóricos , Método de MontecarloRESUMEN
BACKGROUND: Rare missense mutations in the gene encoding coatomer subunit alpha (COPA) have recently been shown to cause autoimmune interstitial lung, joint and kidney disease, also known as COPA syndrome, under a dominant mode of inheritance. CASE PRESENTATION: Here we describe an Icelandic family with three affected individuals over two generations with a rare clinical presentation of lung and joint disease and a histological diagnosis of follicular bronchiolitis. We performed whole-genome sequencing (WGS) of the three affected as well as three unaffected members of the family, and searched for rare genotypes associated with disease using 30,067 sequenced Icelanders as a reference population. We assessed all coding and splicing variants, prioritizing variants in genes known to cause interstitial lung disease. We detected a heterozygous missense mutation, p.Glu241Lys, in the COPA gene, private to the affected family members. The mutation occurred de novo in the paternal germline of the index case and was absent from 30,067 Icelandic genomes and 141,353 individuals from the genome Aggregation Database (gnomAD). The mutation occurs within the conserved and functionally important WD40 domain of the COPA protein. CONCLUSIONS: This is the second report of the p.Glu241Lys mutation in COPA, indicating the recurrent nature of the mutation. The mutation was reported to co-segregate with COPA syndrome in a large family from the USA with five affected members, and classified as pathogenic. The two separate occurrences of the p.Glu241Lys mutation in cases and its absence from a large number of sequenced genomes confirms its role in the pathogenesis of the COPA syndrome.
Asunto(s)
Proteína Coatómero/genética , Síndromes de Inmunodeficiencia/diagnóstico , Síndromes de Inmunodeficiencia/genética , Mutación Missense , Artritis/diagnóstico , Artritis/genética , Niño , Preescolar , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Islandia , Lactante , Enfermedades Pulmonares/diagnóstico , Enfermedades Pulmonares/genética , Masculino , LinajeRESUMEN
Horses, asses, and zebras belong to a single genus, Equus, which emerged 4.0-4.5 Mya. Although the equine fossil record represents a textbook example of evolution, the succession of events that gave rise to the diversity of species existing today remains unclear. Here we present six genomes from each living species of asses and zebras. This completes the set of genomes available for all extant species in the genus, which was hitherto represented only by the horse and the domestic donkey. In addition, we used a museum specimen to characterize the genome of the quagga zebra, which was driven to extinction in the early 1900s. We scan the genomes for lineage-specific adaptations and identify 48 genes that have evolved under positive selection and are involved in olfaction, immune response, development, locomotion, and behavior. Our extensive genome dataset reveals a highly dynamic demographic history with synchronous expansions and collapses on different continents during the last 400 ky after major climatic events. We show that the earliest speciation occurred with gene flow in Northern America, and that the ancestor of present-day asses and zebras dispersed into the Old World 2.1-3.4 Mya. Strikingly, we also find evidence for gene flow involving three contemporary equine species despite chromosomal numbers varying from 16 pairs to 31 pairs. These findings challenge the claim that the accumulation of chromosomal rearrangements drive complete reproductive isolation, and promote equids as a fundamental model for understanding the interplay between chromosomal structure, gene flow, and, ultimately, speciation.
Asunto(s)
Cromosomas de los Mamíferos/genética , Equidae/genética , Evolución Molecular , Extinción Biológica , Flujo Génico , África , Animales , América del NorteRESUMEN
The domestication of the horse â¼ 5.5 kya and the emergence of mounted riding, chariotry, and cavalry dramatically transformed human civilization. However, the genetics underlying horse domestication are difficult to reconstruct, given the near extinction of wild horses. We therefore sequenced two ancient horse genomes from Taymyr, Russia (at 7.4- and 24.3-fold coverage), both predating the earliest archeological evidence of domestication. We compared these genomes with genomes of domesticated horses and the wild Przewalski's horse and found genetic structure within Eurasia in the Late Pleistocene, with the ancient population contributing significantly to the genetic variation of domesticated breeds. We furthermore identified a conservative set of 125 potential domestication targets using four complementary scans for genes that have undergone positive selection. One group of genes is involved in muscular and limb development, articular junctions, and the cardiac system, and may represent physiological adaptations to human utilization. A second group consists of genes with cognitive functions, including social behavior, learning capabilities, fear response, and agreeableness, which may have been key for taming horses. We also found that domestication is associated with inbreeding and an excess of deleterious mutations. This genetic load is in line with the "cost of domestication" hypothesis also reported for rice, tomatoes, and dogs, and it is generally attributed to the relaxation of purifying selection resulting from the strong demographic bottlenecks accompanying domestication. Our work demonstrates the power of ancient genomes to reconstruct the complex genetic changes that transformed wild animals into their domesticated forms, and the population context in which this process took place.
Asunto(s)
Animales Domésticos/genética , Evolución Molecular , Genoma/fisiología , Caballos/genética , Animales , Sistema Cardiovascular/anatomía & histología , Perros , Miembro Posterior/anatomía & histología , Miembro Posterior/fisiología , Caballos/anatomía & histología , Humanos , Endogamia , Federación de RusiaRESUMEN
MOTIVATION: Ancient DNA (aDNA) molecules in fossilized bones and teeth, coprolites, sediments, mummified specimens and museum collections represent fantastic sources of information for evolutionary biologists, revealing the agents of past epidemics and the dynamics of past populations. However, the analysis of aDNA generally faces two major issues. Firstly, sequences consist of a mixture of endogenous and various exogenous backgrounds, mostly microbial. Secondly, high nucleotide misincorporation rates can be observed as a result of severe post-mortem DNA damage. Such misincorporation patterns are instrumental to authenticate ancient sequences versus modern contaminants. We recently developed the user-friendly mapDamage package that identifies such patterns from next-generation sequencing (NGS) sequence datasets. The absence of formal statistical modeling of the DNA damage process, however, precluded rigorous quantitative comparisons across samples. RESULTS: Here, we describe mapDamage 2.0 that extends the original features of mapDamage by incorporating a statistical model of DNA damage. Assuming that damage events depend only on sequencing position and post-mortem deamination, our Bayesian statistical framework provides estimates of four key features of aDNA molecules: the average length of overhangs (λ), nick frequency (ν) and cytosine deamination rates in both double-stranded regions ( ) and overhangs ( ). Our model enables rescaling base quality scores according to their probability of being damaged. mapDamage 2.0 handles NGS datasets with ease and is compatible with a wide range of DNA library protocols. AVAILABILITY: mapDamage 2.0 is available at ginolhac.github.io/mapDamage/ as a Python package and documentation is maintained at the Centre for GeoGenetics Web site (geogenetics.ku.dk/publications/mapdamage2.0/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Daño del ADN , Secuenciación de Nucleótidos de Alto Rendimiento , Análisis de Secuencia de ADN , Programas Informáticos , Teorema de Bayes , Citosina/metabolismo , Desaminación , Fósiles , HumanosRESUMEN
Marfan syndrome (MFS) is an autosomal dominant condition characterized by aortic aneurysm, skeletal abnormalities, and lens dislocation, and is caused by variants in the FBN1 gene. To explore causes of MFS and the prevalence of the disease in Iceland we collected information from all living individuals with a clinical diagnosis of MFS in Iceland (n = 32) and performed whole-genome sequencing of those who did not have a confirmed genetic diagnosis (27/32). Moreover, to assess a potential underdiagnosis of MFS in Iceland we attempted a genotype-based approach to identify individuals with MFS. We interrogated deCODE genetics' database of 35,712 whole-genome sequenced individuals to search for rare sequence variants in FBN1. Overall, we identified 15 pathogenic or likely pathogenic variants in FBN1 in 44 individuals, only 22 of whom were previously diagnosed with MFS. The most common of these variants, NM_000138.4:c.8038 C > T p.(Arg2680Cys), is present in a multi-generational pedigree, and was found to stem from a single forefather born around 1840. The p.(Arg2680Cys) variant associates with a form of MFS that seems to have an enrichment of abdominal aortic aneurysm, suggesting that this may be a particularly common feature of p.(Arg2680Cys)-associated MFS. Based on these combined genetic and clinical data, we show that MFS prevalence in Iceland could be as high as 1/6,600 in Iceland, compared to 1/10,000 based on clinical diagnosis alone, which indicates underdiagnosis of this actionable genetic disorder.
Asunto(s)
Síndrome de Marfan , Humanos , Síndrome de Marfan/diagnóstico , Síndrome de Marfan/epidemiología , Síndrome de Marfan/genética , Islandia/epidemiología , Fibrilina-1/genética , Genotipo , Linaje , Mutación , Adipoquinas/genéticaRESUMEN
Two-thirds of all human conceptions are lost, in most cases before clinical detection. The lack of detailed understanding of the causes of pregnancy losses constrains focused counseling for future pregnancies. We have previously shown that a missense variant in synaptonemal complex central element protein 2 (SYCE2), in a key residue for the assembly of the synaptonemal complex backbone, associates with recombination traits. Here we show that it also increases risk of pregnancy loss in a genome-wide association analysis on 114,761 women with reported pregnancy loss. We further show that the variant associates with more random placement of crossovers and lower recombination rate in longer chromosomes but higher in the shorter ones. These results support the hypothesis that some pregnancy losses are due to failures in recombination. They further demonstrate that variants with a substantial effect on the quality of recombination can be maintained in the population.
Asunto(s)
Proteínas Nucleares , Complejo Sinaptonémico , Humanos , Femenino , Embarazo , Complejo Sinaptonémico/metabolismo , Proteínas Nucleares/metabolismo , Estudio de Asociación del Genoma Completo , Proteínas Cromosómicas no Histona/metabolismo , Recombinación Genética , MeiosisRESUMEN
Gene promoter and enhancer sequences are bound by transcription factors and are depleted of methylated CpG sites (cytosines preceding guanines in DNA). The absence of methylated CpGs in these sequences typically correlates with increased gene expression, indicating a regulatory role for methylation. We used nanopore sequencing to determine haplotype-specific methylation rates of 15.3 million CpG units in 7,179 whole-blood genomes. We identified 189,178 methylation depleted sequences where three or more proximal CpGs were unmethylated on at least one haplotype. A total of 77,789 methylation depleted sequences (~41%) associated with 80,503 cis-acting sequence variants, which we termed allele-specific methylation quantitative trait loci (ASM-QTLs). RNA sequencing of 896 samples from the same blood draws used to perform nanopore sequencing showed that the ASM-QTL, that is, DNA sequence variability, drives most of the correlation found between gene expression and CpG methylation. ASM-QTLs were enriched 40.2-fold (95% confidence interval 32.2, 49.9) among sequence variants associating with hematological traits, demonstrating that ASM-QTLs are important functional units in the noncoding genome.
Asunto(s)
Islas de CpG , Metilación de ADN , Sitios de Carácter Cuantitativo , Humanos , Regiones Promotoras Genéticas , Haplotipos , Alelos , Regulación de la Expresión Génica , Variación Genética , Secuenciación de Nanoporos/métodos , Genoma HumanoRESUMEN
Age at menopause (AOM) has a substantial impact on fertility and disease risk. While many loci with variants that associate with AOM have been identified through genome-wide association studies (GWAS) under an additive model, other genetic models are rarely considered1. Here through GWAS meta-analysis under the recessive model of 174,329 postmenopausal women from Iceland, Denmark, the United Kingdom (UK; UK Biobank) and Norway, we study low-frequency variants with a large effect on AOM. We discovered that women homozygous for the stop-gain variant rs117316434 (A) in CCDC201 (p.(Arg162Ter), minor allele frequency ~1%) reached menopause 9 years earlier than other women (P = 1.3 × 10-15). The genotype is present in one in 10,000 northern European women and leads to primary ovarian insufficiency in close to half of them. Consequently, homozygotes have fewer children, and the age at last childbirth is 5 years earlier (P = 3.8 × 10-5). The CCDC201 gene was only found in humans in 2022 and is highly expressed in oocytes. Homozygosity for CCDC201 loss-of-function has a substantial impact on female reproductive health, and homozygotes would benefit from reproductive counseling and treatment for symptoms of early menopause.