Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 174
Filter
1.
Cell ; 185(16): 3041-3055.e25, 2022 08 04.
Article in English | MEDLINE | ID: mdl-35917817

ABSTRACT

Rare copy-number variants (rCNVs) include deletions and duplications that occur infrequently in the global human population and can confer substantial risk for disease. In this study, we aimed to quantify the properties of haploinsufficiency (i.e., deletion intolerance) and triplosensitivity (i.e., duplication intolerance) throughout the human genome. We harmonized and meta-analyzed rCNVs from nearly one million individuals to construct a genome-wide catalog of dosage sensitivity across 54 disorders, which defined 163 dosage sensitive segments associated with at least one disorder. These segments were typically gene dense and often harbored dominant dosage sensitive driver genes, which we were able to prioritize using statistical fine-mapping. Finally, we designed an ensemble machine-learning model to predict probabilities of dosage sensitivity (pHaplo & pTriplo) for all autosomal genes, which identified 2,987 haploinsufficient and 1,559 triplosensitive genes, including 648 that were uniquely triplosensitive. This dosage sensitivity resource will provide broad utility for human disease research and clinical genetics.


Subject(s)
DNA Copy Number Variations , Genome, Human , DNA Copy Number Variations/genetics , Gene Dosage , Haploinsufficiency/genetics , Humans
2.
Cell ; 182(1): 189-199.e15, 2020 07 09.
Article in English | MEDLINE | ID: mdl-32531199

ABSTRACT

Structural variants contribute substantially to genetic diversity and are important evolutionarily and medically, but they are still understudied. Here we present a comprehensive analysis of structural variation in the Human Genome Diversity panel, a high-coverage dataset of 911 samples from 54 diverse worldwide populations. We identify, in total, 126,018 variants, 78% of which were not identified in previous global sequencing projects. Some reach high frequency and are private to continental groups or even individual populations, including regionally restricted runaway duplications and putatively introgressed variants from archaic hominins. By de novo assembly of 25 genomes using linked-read sequencing, we discover 1,643 breakpoint-resolved unique insertions, in aggregate accounting for 1.9 Mb of sequence absent from the GRCh38 reference. Our results illustrate the limitation of a single human reference and the need for high-quality genomes from diverse populations to fully discover and understand human genetic variation.


Subject(s)
Genetics, Population , Genomic Structural Variation , Alleles , Databases, Genetic , Gene Dosage , Gene Duplication , Gene Frequency/genetics , Genetic Variation , Genome, Human , Humans
3.
Cell ; 168(5): 830-842.e7, 2017 02 23.
Article in English | MEDLINE | ID: mdl-28235197

ABSTRACT

De novo copy number variants (dnCNVs) arising at multiple loci in a personal genome have usually been considered to reflect cancer somatic genomic instabilities. We describe a multiple dnCNV (MdnCNV) phenomenon in which individuals with genomic disorders carry five to ten constitutional dnCNVs. These CNVs originate from independent formation incidences, are predominantly tandem duplications or complex gains, exhibit breakpoint junction features reminiscent of replicative repair, and show increased de novo point mutations flanking the rearrangement junctions. The active CNV mutation shower appears to be restricted to a transient perizygotic period. We propose that a defect in the CNV formation process is responsible for the "CNV-mutator state," and this state is dampened after early embryogenesis. The constitutional MdnCNV phenomenon resembles chromosomal instability in various cancers. Investigations of this phenomenon may provide unique access to understanding genomic disorders, structural variant mutagenesis, human evolution, and cancer biology.


Subject(s)
Chromosome Aberrations , DNA Copy Number Variations , Genetic Diseases, Inborn/embryology , Genetic Diseases, Inborn/genetics , Genomic Instability , Mutation , Chromosome Breakpoints , Chromosome Duplication , DNA Replication , Embryonic Development , Female , Gametogenesis , Humans , Male
4.
Nature ; 633(8030): 608-614, 2024 Sep.
Article in English | MEDLINE | ID: mdl-39261734

ABSTRACT

Human genetic studies of common variants have provided substantial insight into the biological mechanisms that govern ovarian ageing1. Here we report analyses of rare protein-coding variants in 106,973 women from the UK Biobank study, implicating genes with effects around five times larger than previously found for common variants (ETAA1, ZNF518A, PNPLA8, PALB2 and SAMHD1). The SAMHD1 association reinforces the link between ovarian ageing and cancer susceptibility1, with damaging germline variants being associated with extended reproductive lifespan and increased all-cause cancer risk in both men and women. Protein-truncating variants in ZNF518A are associated with shorter reproductive lifespan-that is, earlier age at menopause (by 5.61 years) and later age at menarche (by 0.56 years). Finally, using 8,089 sequenced trios from the 100,000 Genomes Project (100kGP), we observe that common genetic variants associated with earlier ovarian ageing associate with an increased rate of maternally derived de novo mutations. Although we were unable to replicate the finding in independent samples from the deCODE study, it is consistent with the expected role of DNA damage response genes in maintaining the genetic integrity of germ cells. This study provides evidence of genetic links between age of menopause and cancer risk.


Subject(s)
Aging , Genetic Predisposition to Disease , Menopause , Mutation Rate , Neoplasms , Ovary , Adult , Female , Humans , Male , Middle Aged , Aging/genetics , Aging/pathology , DNA Damage/genetics , Fertility/genetics , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Genome, Human/genetics , Germ-Line Mutation/genetics , Menarche/genetics , Menopause/genetics , Neoplasms/genetics , Ovary/metabolism , Ovary/pathology , Time Factors , UK Biobank , United Kingdom/epidemiology
5.
Nature ; 632(8026): 832-840, 2024 Aug.
Article in English | MEDLINE | ID: mdl-38991538

ABSTRACT

Around 60% of individuals with neurodevelopmental disorders (NDD) remain undiagnosed after comprehensive genetic testing, primarily of protein-coding genes1. Large genome-sequenced cohorts are improving our ability to discover new diagnoses in the non-coding genome. Here we identify the non-coding RNA RNU4-2 as a syndromic NDD gene. RNU4-2 encodes the U4 small nuclear RNA (snRNA), which is a critical component of the U4/U6.U5 tri-snRNP complex of the major spliceosome2. We identify an 18 base pair region of RNU4-2 mapping to two structural elements in the U4/U6 snRNA duplex (the T-loop and stem III) that is severely depleted of variation in the general population, but in which we identify heterozygous variants in 115 individuals with NDD. Most individuals (77.4%) have the same highly recurrent single base insertion (n.64_65insT). In 54 individuals in whom it could be determined, the de novo variants were all on the maternal allele. We demonstrate that RNU4-2 is highly expressed in the developing human brain, in contrast to RNU4-1 and other U4 homologues. Using RNA sequencing, we show how 5' splice-site use is systematically disrupted in individuals with RNU4-2 variants, consistent with the known role of this region during spliceosome activation. Finally, we estimate that variants in this 18 base pair region explain 0.4% of individuals with NDD. This work underscores the importance of non-coding genes in rare disorders and will provide a diagnosis to thousands of individuals with NDD worldwide.


Subject(s)
Mutation , Neurodevelopmental Disorders , RNA, Small Nuclear , Adolescent , Child , Child, Preschool , Female , Humans , Infant , Male , Young Adult , Alleles , Brain/growth & development , Brain/metabolism , Heterozygote , Neurodevelopmental Disorders/genetics , RNA Splice Sites/genetics , RNA, Small Nuclear/genetics , Spliceosomes/genetics , Syndrome , Rare Diseases/genetics , Gene Expression Regulation, Developmental
6.
Nature ; 603(7903): 858-863, 2022 03.
Article in English | MEDLINE | ID: mdl-35322230

ABSTRACT

Genome-wide sequencing of human populations has revealed substantial variation among genes in the intensity of purifying selection acting on damaging genetic variants1. Although genes under the strongest selective constraint are highly enriched for associations with Mendelian disorders, most of these genes are not associated with disease and therefore the nature of the selection acting on them is not known2. Here we show that genetic variants that damage these genes are associated with markedly reduced reproductive success, primarily owing to increased childlessness, with a stronger effect in males than in females. We present evidence that increased childlessness is probably mediated by genetically associated cognitive and behavioural traits, which may mean that male carriers are less likely to find reproductive partners. This reduction in reproductive success may account for 20% of purifying selection against heterozygous variants that ablate protein-coding genes. Although this genetic association may only account for a very minor fraction of the overall likelihood of being childless (less than 1%), especially when compared to more influential sociodemographic factors, it may influence how genes evolve over time.


Subject(s)
Reproduction , Selection, Genetic , Chromosome Mapping , Female , Heterozygote , Humans , Male , Phenotype , Reproduction/genetics
7.
N Engl J Med ; 388(17): 1559-1571, 2023 Apr 27.
Article in English | MEDLINE | ID: mdl-37043637

ABSTRACT

BACKGROUND: Pediatric disorders include a range of highly penetrant, genetically heterogeneous conditions amenable to genomewide diagnostic approaches. Finding a molecular diagnosis is challenging but can have profound lifelong benefits. METHODS: We conducted a large-scale sequencing study involving more than 13,500 families with probands with severe, probably monogenic, difficult-to-diagnose developmental disorders from 24 regional genetics services in the United Kingdom and Ireland. Standardized phenotypic data were collected, and exome sequencing and microarray analyses were performed to investigate novel genetic causes. We developed an iterative variant analysis pipeline and reported candidate variants to clinical teams for validation and diagnostic interpretation to inform communication with families. Multiple regression analyses were performed to evaluate factors affecting the probability of diagnosis. RESULTS: A total of 13,449 probands were included in the analyses. On average, we reported 1.0 candidate variant per parent-offspring trio and 2.5 variants per singleton proband. Using clinical and computational approaches to variant classification, we made a diagnosis in approximately 41% of probands (5502 of 13,449). Of 3599 probands in trios who received a diagnosis by clinical assertion, approximately 76% had a pathogenic de novo variant. Another 22% of probands (2997 of 13,449) had variants of uncertain significance in genes that were strongly linked to monogenic developmental disorders. Recruitment in a parent-offspring trio had the largest effect on the probability of diagnosis (odds ratio, 4.70; 95% confidence interval [CI], 4.16 to 5.31). Probands were less likely to receive a diagnosis if they were born extremely prematurely (i.e., 22 to 27 weeks' gestation; odds ratio, 0.39; 95% CI, 0.22 to 0.68), had in utero exposure to antiepileptic medications (odds ratio, 0.44; 95% CI, 0.29 to 0.67), had mothers with diabetes (odds ratio, 0.52; 95% CI, 0.41 to 0.67), or were of African ancestry (odds ratio, 0.51; 95% CI, 0.31 to 0.78). CONCLUSIONS: Among probands with severe, probably monogenic, difficult-to-diagnose developmental disorders, multimodal analysis of genomewide data had good diagnostic power, even after previous attempts at diagnosis. (Funded by the Health Innovation Challenge Fund and Wellcome Sanger Institute.).


Subject(s)
Genomics , Rare Diseases , Child , Humans , Exome , Ireland/epidemiology , United Kingdom/epidemiology , Rare Diseases/diagnosis , Rare Diseases/epidemiology , Rare Diseases/genetics , Oligonucleotide Array Sequence Analysis , Genetic Association Studies , Neurodevelopmental Disorders/diagnosis , Neurodevelopmental Disorders/genetics , Congenital Abnormalities/diagnosis , Congenital Abnormalities/genetics , Growth Disorders/diagnosis , Growth Disorders/genetics , Facies , Child Behavior Disorders/diagnosis , Child Behavior Disorders/genetics , Genetic Diseases, Inborn/diagnosis , Genetic Diseases, Inborn/genetics
8.
Nature ; 577(7789): 179-189, 2020 01.
Article in English | MEDLINE | ID: mdl-31915397

ABSTRACT

A primary goal of human genetics is to identify DNA sequence variants that influence biomedical traits, particularly those related to the onset and progression of human disease. Over the past 25 years, progress in realizing this objective has been transformed by advances in technology, foundational genomic resources and analytical tools, and by access to vast amounts of genotype and phenotype data. Genetic discoveries have substantially improved our understanding of the mechanisms responsible for many rare and common diseases and driven development of novel preventative and therapeutic strategies. Medical innovation will increasingly focus on delivering care tailored to individual patterns of genetic predisposition.


Subject(s)
Genetic Variation , Animals , Genetic Testing , Genomics , Genotype , Humans , Phenotype , Rare Diseases/genetics
9.
Nature ; 586(7831): 757-762, 2020 10.
Article in English | MEDLINE | ID: mdl-33057194

ABSTRACT

De novo mutations in protein-coding genes are a well-established cause of developmental disorders1. However, genes known to be associated with developmental disorders account for only a minority of the observed excess of such de novo mutations1,2. Here, to identify previously undescribed genes associated with developmental disorders, we integrate healthcare and research exome-sequence data from 31,058 parent-offspring trios of individuals with developmental disorders, and develop a simulation-based statistical test to identify gene-specific enrichment of de novo mutations. We identified 285 genes that were significantly associated with developmental disorders, including 28 that had not previously been robustly associated with developmental disorders. Although we detected more genes associated with developmental disorders, much of the excess of de novo mutations in protein-coding genes remains unaccounted for. Modelling suggests that more than 1,000 genes associated with developmental disorders have not yet been described, many of which are likely to be less penetrant than the currently known genes. Research access to clinical diagnostic datasets will be critical for completing the map of genes associated with developmental disorders.


Subject(s)
DNA Mutational Analysis , Data Analysis , Databases, Genetic , Datasets as Topic , Delivery of Health Care/statistics & numerical data , Developmental Disabilities/genetics , Genetic Diseases, Inborn/genetics , Cohort Studies , DNA Copy Number Variations/genetics , Developmental Disabilities/diagnosis , Europe , Female , Genetic Diseases, Inborn/diagnosis , Germ-Line Mutation/genetics , Haploinsufficiency/genetics , Humans , Male , Mutation, Missense/genetics , Penetrance , Perinatal Death , Sample Size
10.
Am J Hum Genet ; 109(12): 2105-2109, 2022 12 01.
Article in English | MEDLINE | ID: mdl-36459978

ABSTRACT

Synonymous mutations change the DNA sequence of a gene without affecting the amino acid sequence of the encoded protein. Although some synonymous mutations can affect RNA splicing, translational efficiency, and mRNA stability, studies in human genetics, mutagenesis screens, and other experiments and evolutionary analyses have repeatedly shown that most synonymous variants are neutral or only weakly deleterious, with some notable exceptions. Based on a recent study in yeast, there have been claims that synonymous mutations could be as important as nonsynonymous mutations in causing disease, assuming the yeast findings hold up and translate to humans. Here, we argue that there is insufficient evidence to overturn the large, coherent body of knowledge establishing the predominant neutrality of synonymous variants in the human genome.


Subject(s)
Biological Evolution , Saccharomyces cerevisiae , Humans , Mutation/genetics , Amino Acid Sequence , Genome, Human/genetics
11.
Crit Rev Clin Lab Sci ; : 1-24, 2024 Jun 10.
Article in English | MEDLINE | ID: mdl-38855982

ABSTRACT

This scoping review aimed to synthesize the analytical techniques used and methodological limitations encountered when undertaking secondary research using residual neonatal dried blood spot (DBS) samples. Studies that used residual neonatal DBS samples for secondary research (i.e. research not related to newborn screening for inherited genetic and metabolic disorders) were identified from six electronic databases: Cochrane Library, Cumulative Index to Nursing and Allied Health Literature (CINAHL), Embase, Medline, PubMed and Scopus. Inclusion was restricted to studies published from 1973 and written in or translated into English that reported the storage, extraction and testing of neonatal DBS samples. Sixty-seven studies were eligible for inclusion. Included studies were predominantly methodological in nature and measured various analytes, including nucleic acids, proteins, metabolites, environmental pollutants, markers of prenatal substance use and medications. Neonatal DBS samples were stored over a range of temperatures (ambient temperature, cold storage or frozen) and durations (two weeks to 40.5 years), both of which impacted the recovery of some analytes, particularly amino acids, antibodies and environmental pollutants. The size of DBS sample used and potential contamination were also cited as methodological limitations. Residual neonatal DBS samples retained by newborn screening programs are a promising resource for secondary research purposes, with many studies reporting the successful measurement of analytes even from neonatal DBS samples stored for long periods of time in suboptimal temperatures and conditions.

12.
Am J Hum Genet ; 108(11): 2186-2194, 2021 11 04.
Article in English | MEDLINE | ID: mdl-34626536

ABSTRACT

Structural variation (SV) describes a broad class of genetic variation greater than 50 bp in size. SVs can cause a wide range of genetic diseases and are prevalent in rare developmental disorders (DDs). Individuals presenting with DDs are often referred for diagnostic testing with chromosomal microarrays (CMAs) to identify large copy-number variants (CNVs) and/or with single-gene, gene-panel, or exome sequencing (ES) to identify single-nucleotide variants, small insertions/deletions, and CNVs. However, individuals with pathogenic SVs undetectable by conventional analysis often remain undiagnosed. Consequently, we have developed the tool InDelible, which interrogates short-read sequencing data for split-read clusters characteristic of SV breakpoints. We applied InDelible to 13,438 probands with severe DDs recruited as part of the Deciphering Developmental Disorders (DDD) study and discovered 63 rare, damaging variants in genes previously associated with DDs missed by standard SNV, indel, or CNV discovery approaches. Clinical review of these 63 variants determined that about half (30/63) were plausibly pathogenic. InDelible was particularly effective at ascertaining variants between 21 and 500 bp in size and increased the total number of potentially pathogenic variants identified by DDD in this size range by 42.9%. Of particular interest were seven confirmed de novo variants in MECP2, which represent 35.0% of all de novo protein-truncating variants in MECP2 among DDD study participants. InDelible provides a framework for the discovery of pathogenic SVs that are most likely missed by standard analytical workflows and has the potential to improve the diagnostic yield of ES across a broad range of genetic diseases.


Subject(s)
Developmental Disabilities/diagnosis , Developmental Disabilities/genetics , Exome Sequencing/methods , Child , Female , Humans , Male , Methyl-CpG-Binding Protein 2/genetics
13.
Am J Hum Genet ; 108(6): 1083-1094, 2021 06 03.
Article in English | MEDLINE | ID: mdl-34022131

ABSTRACT

Clinical genetic testing of protein-coding regions identifies a likely causative variant in only around half of developmental disorder (DD) cases. The contribution of regulatory variation in non-coding regions to rare disease, including DD, remains very poorly understood. We screened 9,858 probands from the Deciphering Developmental Disorders (DDD) study for de novo mutations in the 5' untranslated regions (5' UTRs) of genes within which variants have previously been shown to cause DD through a dominant haploinsufficient mechanism. We identified four single-nucleotide variants and two copy-number variants upstream of MEF2C in a total of ten individual probands. We developed multiple bespoke and orthogonal experimental approaches to demonstrate that these variants cause DD through three distinct loss-of-function mechanisms, disrupting transcription, translation, and/or protein function. These non-coding region variants represent 23% of likely diagnoses identified in MEF2C in the DDD cohort, but these would all be missed in standard clinical genetics approaches. Nonetheless, these variants are readily detectable in exome sequence data, with 30.7% of 5' UTR bases across all genes well covered in the DDD dataset. Our analyses show that non-coding variants upstream of genes within which coding variants are known to cause DD are an important cause of severe disease and demonstrate that analyzing 5' UTRs can increase diagnostic yield. We also show how non-coding variants can help inform both the disease-causing mechanism underlying protein-coding variants and dosage tolerance of the gene.


Subject(s)
5' Untranslated Regions , Developmental Disabilities/etiology , Genetic Predisposition to Disease , Loss of Function Mutation , Child , Cohort Studies , DNA Copy Number Variations , Developmental Disabilities/pathology , Humans , MEF2 Transcription Factors/genetics , Exome Sequencing
14.
Brain ; 146(11): 4766-4783, 2023 11 02.
Article in English | MEDLINE | ID: mdl-37437211

ABSTRACT

KPTN-related disorder is an autosomal recessive disorder associated with germline variants in KPTN (previously known as kaptin), a component of the mTOR regulatory complex KICSTOR. To gain further insights into the pathogenesis of KPTN-related disorder, we analysed mouse knockout and human stem cell KPTN loss-of-function models. Kptn -/- mice display many of the key KPTN-related disorder phenotypes, including brain overgrowth, behavioural abnormalities, and cognitive deficits. By assessment of affected individuals, we have identified widespread cognitive deficits (n = 6) and postnatal onset of brain overgrowth (n = 19). By analysing head size data from their parents (n = 24), we have identified a previously unrecognized KPTN dosage-sensitivity, resulting in increased head circumference in heterozygous carriers of pathogenic KPTN variants. Molecular and structural analysis of Kptn-/- mice revealed pathological changes, including differences in brain size, shape and cell numbers primarily due to abnormal postnatal brain development. Both the mouse and differentiated induced pluripotent stem cell models of the disorder display transcriptional and biochemical evidence for altered mTOR pathway signalling, supporting the role of KPTN in regulating mTORC1. By treatment in our KPTN mouse model, we found that the increased mTOR signalling downstream of KPTN is rapamycin sensitive, highlighting possible therapeutic avenues with currently available mTOR inhibitors. These findings place KPTN-related disorder in the broader group of mTORC1-related disorders affecting brain structure, cognitive function and network integrity.


Subject(s)
Signal Transduction , TOR Serine-Threonine Kinases , Humans , Animals , Mice , Signal Transduction/genetics , TOR Serine-Threonine Kinases/metabolism , Brain/metabolism , Mechanistic Target of Rapamycin Complex 1/metabolism , Cognition , Microfilament Proteins/genetics
15.
Nature ; 555(7698): 611-616, 2018 03 29.
Article in English | MEDLINE | ID: mdl-29562236

ABSTRACT

We previously estimated that 42% of patients with severe developmental disorders carry pathogenic de novo mutations in coding sequences. The role of de novo mutations in regulatory elements affecting genes associated with developmental disorders, or other genes, has been essentially unexplored. We identified de novo mutations in three classes of putative regulatory elements in almost 8,000 patients with developmental disorders. Here we show that de novo mutations in highly evolutionarily conserved fetal brain-active elements are significantly and specifically enriched in neurodevelopmental disorders. We identified a significant twofold enrichment of recurrently mutated elements. We estimate that, genome-wide, 1-3% of patients without a diagnostic coding variant carry pathogenic de novo mutations in fetal brain-active regulatory elements and that only 0.15% of all possible mutations within highly conserved fetal brain-active elements cause neurodevelopmental disorders with a dominant mechanism. Our findings represent a robust estimate of the contribution of de novo mutations in regulatory elements to this genetically heterogeneous set of disorders, and emphasize the importance of combining functional and evolutionary evidence to identify regulatory causes of genetic disorders.


Subject(s)
Mutation , Neurodevelopmental Disorders/genetics , Regulatory Sequences, Nucleic Acid/genetics , Brain/metabolism , Conserved Sequence , Developmental Disabilities/genetics , Evolution, Molecular , Exome , Female , Fetus/metabolism , Humans , Male
16.
Nature ; 562(7726): 268-271, 2018 10.
Article in English | MEDLINE | ID: mdl-30258228

ABSTRACT

There are thousands of rare human disorders that are caused by single deleterious, protein-coding genetic variants1. However, patients with the same genetic defect can have different clinical presentations2-4, and some individuals who carry known disease-causing variants can appear unaffected5. Here, to understand what explains these differences, we study a cohort of 6,987 children assessed by clinical geneticists to have severe neurodevelopmental disorders such as global developmental delay and autism, often in combination with abnormalities of other organ systems. Although the genetic causes of these neurodevelopmental disorders are expected to be almost entirely monogenic, we show that 7.7% of variance in risk is attributable to inherited common genetic variation. We replicated this genome-wide common variant burden by showing, in an independent sample of 728 trios (comprising a child plus both parents) from the same cohort, that this burden is over-transmitted from parents to children with neurodevelopmental disorders. Our common-variant signal is significantly positively correlated with genetic predisposition to lower educational attainment, decreased intelligence and risk of schizophrenia. We found that common-variant risk was not significantly different between individuals with and without a known protein-coding diagnostic variant, which suggests that common-variant risk affects patients both with and without a monogenic diagnosis. In addition, previously published common-variant scores for autism, height, birth weight and intracranial volume were all correlated with these traits within our cohort, which suggests that phenotypic expression in individuals with monogenic disorders is affected by the same variants as in the general population. Our results demonstrate that common genetic variation affects both overall risk and clinical presentation in neurodevelopmental disorders that are typically considered to be monogenic.


Subject(s)
Genetic Predisposition to Disease , Genetic Variation , Neurodevelopmental Disorders/genetics , Rare Diseases/genetics , Autistic Disorder/genetics , Birth Weight/genetics , Body Height/genetics , Case-Control Studies , Cohort Studies , Developmental Disabilities/genetics , Female , Genome-Wide Association Study , Humans , Intelligence/genetics , Linkage Disequilibrium , Male , Multifactorial Inheritance/genetics , Phenotype , Schizophrenia/genetics
17.
PLoS Genet ; 17(7): e1009679, 2021 07.
Article in English | MEDLINE | ID: mdl-34324492

ABSTRACT

Numerous genetic studies have established a role for rare genomic variants in Congenital Heart Disease (CHD) at the copy number variation (CNV) and de novo variant (DNV) level. To identify novel haploinsufficient CHD disease genes, we performed an integrative analysis of CNVs and DNVs identified in probands with CHD including cases with sporadic thoracic aortic aneurysm. We assembled CNV data from 7,958 cases and 14,082 controls and performed a gene-wise analysis of the burden of rare genomic deletions in cases versus controls. In addition, we performed variation rate testing for DNVs identified in 2,489 parent-offspring trios. Our analysis revealed 21 genes which were significantly affected by rare CNVs and/or DNVs in probands. Fourteen of these genes have previously been associated with CHD while the remaining genes (FEZ1, MYO16, ARID1B, NALCN, WAC, KDM5B and WHSC1) have only been associated in small cases series or show new associations with CHD. In addition, a systems level analysis revealed affected protein-protein interaction networks involved in Notch signaling pathway, heart morphogenesis, DNA repair and cilia/centrosome function. Taken together, this approach highlights the importance of re-analyzing existing datasets to strengthen disease association and identify novel disease genes and pathways.


Subject(s)
DNA Copy Number Variations/genetics , Haploinsufficiency/genetics , Heart Defects, Congenital/genetics , Databases, Genetic , Gene Expression/genetics , Gene Expression Profiling/methods , Genetic Predisposition to Disease/genetics , Genomics/methods , Humans , Ion Channels/genetics , Membrane Proteins/genetics , Polymorphism, Single Nucleotide/genetics , Transcriptome/genetics
19.
Nature ; 543(7647): 714-718, 2017 03 30.
Article in English | MEDLINE | ID: mdl-28329761

ABSTRACT

Somatic cells acquire mutations throughout the course of an individual's life. Mutations occurring early in embryogenesis are often present in a substantial proportion of, but not all, cells in postnatal humans and thus have particular characteristics and effects. Depending on their location in the genome and the proportion of cells they are present in, these mosaic mutations can cause a wide range of genetic disease syndromes and predispose carriers to cancer. They have a high chance of being transmitted to offspring as de novo germline mutations and, in principle, can provide insights into early human embryonic cell lineages and their contributions to adult tissues. Although it is known that gross chromosomal abnormalities are remarkably common in early human embryos, our understanding of early embryonic somatic mutations is very limited. Here we use whole-genome sequences of normal blood from 241 adults to identify 163 early embryonic mutations. We estimate that approximately three base substitution mutations occur per cell per cell-doubling event in early human embryogenesis and these are mainly attributable to two known mutational signatures. We used the mutations to reconstruct developmental lineages of adult cells and demonstrate that the two daughter cells of many early embryonic cell-doubling events contribute asymmetrically to adult blood at an approximately 2:1 ratio. This study therefore provides insights into the mutation rates, mutational processes and developmental outcomes of cell dynamics that operate during early human embryogenesis.


Subject(s)
Embryo, Mammalian/cytology , Embryo, Mammalian/metabolism , Embryonic Development/genetics , Mutation , Adult , Blood Cells/metabolism , Cell Lineage/genetics , Genome, Human/genetics , Germ-Line Mutation/genetics , Humans , Mosaicism , Mutagenesis , Mutation Rate
20.
Hum Mutat ; 43(6): 682-697, 2022 06.
Article in English | MEDLINE | ID: mdl-35143074

ABSTRACT

DECIPHER (https://www.deciphergenomics.org) is a free web platform for sharing anonymized phenotype-linked variant data from rare disease patients. Its dynamic interpretation interfaces contextualize genomic and phenotypic data to enable more informed variant interpretation, incorporating international standards for variant classification. DECIPHER supports almost all types of germline and mosaic variation in the nuclear and mitochondrial genome: sequence variants, short tandem repeats, copy-number variants, and large structural variants. Patient phenotypes are deposited using Human Phenotype Ontology (HPO) terms, supplemented by quantitative data, which is aggregated to derive gene-specific phenotypic summaries. It hosts data from >250 projects from ~40 countries, openly sharing >40,000 patient records containing >51,000 variants and >172,000 phenotype terms. The rich phenotype-linked variant data in DECIPHER drives rare disease research and diagnosis by enabling patient matching within DECIPHER and with other resources, and has been cited in >2,600 publications. In this study, we describe the types of data deposited to DECIPHER, the variant interpretation tools, and patient matching interfaces which make DECIPHER an invaluable rare disease resource.


Subject(s)
Databases, Genetic , Rare Diseases , Genomics , Humans , Phenotype , Rare Diseases/diagnosis , Rare Diseases/genetics , Software
SELECTION OF CITATIONS
SEARCH DETAIL