RESUMO
To further our understanding of the genetic etiology of autism, we generated and analyzed genome sequence data from 516 idiopathic autism families (2,064 individuals). This resource includes >59 million single-nucleotide variants (SNVs) and 9,212 private copy number variants (CNVs), of which 133,992 and 88 are de novo mutations (DNMs), respectively. We estimate a mutation rate of â¼1.5 × 10-8 SNVs per site per generation with a significantly higher mutation rate in repetitive DNA. Comparing probands and unaffected siblings, we observe several DNM trends. Probands carry more gene-disruptive CNVs and SNVs, resulting in severe missense mutations and mapping to predicted fetal brain promoters and embryonic stem cell enhancers. These differences become more pronounced for autism genes (p = 1.8 × 10-3, OR = 2.2). Patients are more likely to carry multiple coding and noncoding DNMs in different genes, which are enriched for expression in striatal neurons (p = 3 × 10-3), suggesting a path forward for genetically characterizing more complex cases of autism.
Assuntos
Transtorno Autístico/genética , Variações do Número de Cópias de DNA , Polimorfismo de Nucleotídeo Único , Animais , Análise Mutacional de DNA , Feminino , Estudo de Associação Genômica Ampla , Humanos , Mutação INDEL , Masculino , CamundongosRESUMO
During mammalian development, differences in chromatin state coincide with cellular differentiation and reflect changes in the gene regulatory landscape1. In the developing brain, cell fate specification and topographic identity are important for defining cell identity2 and confer selective vulnerabilities to neurodevelopmental disorders3. Here, to identify cell-type-specific chromatin accessibility patterns in the developing human brain, we used a single-cell assay for transposase accessibility by sequencing (scATAC-seq) in primary tissue samples from the human forebrain. We applied unbiased analyses to identify genomic loci that undergo extensive cell-type- and brain-region-specific changes in accessibility during neurogenesis, and an integrative analysis to predict cell-type-specific candidate regulatory elements. We found that cerebral organoids recapitulate most putative cell-type-specific enhancer accessibility patterns but lack many cell-type-specific open chromatin regions that are found in vivo. Systematic comparison of chromatin accessibility across brain regions revealed unexpected diversity among neural progenitor cells in the cerebral cortex and implicated retinoic acid signalling in the specification of neuronal lineage identity in the prefrontal cortex. Together, our results reveal the important contribution of chromatin state to the emerging patterns of cell type diversity and cell fate specification and provide a blueprint for evaluating the fidelity and robustness of cerebral organoids as a model for cortical development.
Assuntos
Encéfalo/citologia , Epigenômica , Neurogênese , Análise de Célula Única , Atlas como Assunto , Encéfalo/crescimento & desenvolvimento , Encéfalo/metabolismo , Cromatina/química , Cromatina/genética , Cromatina/metabolismo , Suscetibilidade a Doenças , Elementos Facilitadores Genéticos , Humanos , Neurônios/citologia , Neurônios/metabolismo , Organoides/citologia , Tretinoína/metabolismoRESUMO
Single nucleotide variants in the general population are common genomic alterations, where the majority are presumed to be silent polymorphisms without known clinical significance. Using human induced pluripotent stem cell (hiPSC) cerebral organoid modeling of the 1.4 megabase Neurofibromatosis type 1 (NF1) deletion syndrome, we previously discovered that the cytokine receptor-like factor-3 (CRLF3) gene, which is co-deleted with the NF1 gene, functions as a major regulator of neuronal maturation. Moreover, children with NF1 and the CRLF3L389P variant have greater autism burden, suggesting that this gene might be important for neurologic function. To explore the functional consequences of this variant, we generated CRLF3L389P-mutant hiPSC lines and Crlf3L389P-mutant genetically engineered mice. While this variant does not impair protein expression, brain structure, or mouse behavior, CRLF3L389P-mutant human cerebral organoids and mouse brains exhibit impaired neuronal maturation and dendrite formation. In addition, Crlf3L389P-mutant mouse neurons have reduced dendrite lengths and branching, without any axonal deficits. Moreover, Crlf3L389P-mutant mouse hippocampal neurons have decreased firing rates and synaptic current amplitudes relative to wild type controls. Taken together, these findings establish the CRLF3L389P variant as functionally deleterious and suggest that it may be a neurodevelopmental disease modifier.
Assuntos
Células-Tronco Pluripotentes Induzidas , Criança , Humanos , Animais , Camundongos , Células-Tronco Pluripotentes Induzidas/metabolismo , Neurônios/metabolismo , Encéfalo/metabolismo , Receptores de Citocinas/metabolismo , Nucleotídeos/metabolismoRESUMO
MOTIVATION: de novo variants (DNVs) are variants that are present in offspring but not in their parents. DNVs are both important for examining mutation rates as well as in the identification of disease-related variation. While efforts have been made to call DNVs, calling of DNVs is still challenging from parent-child sequenced trio data. We developed Hare And Tortoise (HAT) as an automated DNV detection workflow for highly accurate short-read and long-read sequencing data. Reliable detection of DNVs is important for human genomics and HAT addresses this need. RESULTS: HAT is a computational workflow that begins with aligned read data (i.e. CRAM or BAM) from a parent-child sequenced trio and outputs DNVs. HAT detects high-quality DNVs from Illumina short-read whole-exome sequencing, Illumina short-read whole-genome sequencing, and highly accurate PacBio HiFi long-read whole-genome sequencing data. The quality of these DNVs is high based on a series of quality metrics including number of DNVs per individual, percent of DNVs at CpG sites, and percent of DNVs phased to the paternal chromosome of origin. AVAILABILITY AND IMPLEMENTATION: https://github.com/TNTurnerLab/HAT.
Assuntos
Lebres , Tartarugas , Animais , Humanos , Tartarugas/genética , Lebres/genética , Exoma , Genoma Humano , Sequenciamento Completo do Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNARESUMO
Chiari I malformation (CM1), the displacement of the cerebellum through the foramen magnum into the spinal canal, is one of the most common pediatric neurological conditions. Individuals with CM1 can present with neurological symptoms, including severe headaches and sensory or motor deficits, often as a consequence of brainstem compression or syringomyelia (SM). We conducted whole-exome sequencing (WES) on 668 CM1 probands and 232 family members and performed gene-burden and de novo enrichment analyses. A significant enrichment of rare and de novo non-synonymous variants in chromodomain (CHD) genes was observed among individuals with CM1 (combined p = 2.4 × 10-10), including 3 de novo loss-of-function variants in CHD8 (LOF enrichment p = 1.9 × 10-10) and a significant burden of rare transmitted variants in CHD3 (p = 1.8 × 10-6). Overall, individuals with CM1 were found to have significantly increased head circumference (p = 2.6 × 10-9), with many harboring CHD rare variants having macrocephaly. Finally, haploinsufficiency for chd8 in zebrafish led to macrocephaly and posterior hindbrain displacement reminiscent of CM1. These results implicate chromodomain genes and excessive brain growth in CM1 pathogenesis.
Assuntos
Malformação de Arnold-Chiari/genética , Proteínas de Ligação a DNA/genética , Polimorfismo de Nucleotídeo Único/genética , Adulto , Animais , Malformação de Arnold-Chiari/patologia , Encéfalo/patologia , Estudos de Casos e Controles , Feminino , Haploinsuficiência/genética , Humanos , Imageamento por Ressonância Magnética/métodos , Masculino , Siringomielia/genética , Sequenciamento do Exoma/métodos , Peixe-Zebra/genéticaRESUMO
The number of de novo mutations (DNMs) in the human germline is correlated with parental age at conception, but this explains only part of the observed variation. We investigated whether there is a family-specific contribution to the number of DNMs in offspring. The analysis of DNMs in 111 dizygotic twin pairs did not identify a substantial family-specific contribution. This result was corroborated by comparing DNMs of 1669 siblings to those of age-matched unrelated offspring following correction for parental age. In addition, by modeling DNM data from 1714 multi-offspring families, we estimated that the family-specific contribution explains â¼5.2% of the variation in DNM number. Furthermore, we found no substantial difference between the observed number of DNMs and those predicted by a stochastic Poisson process. We conclude that there is a small family-specific contribution to DNM number and that stochasticity explains a large proportion of variation in DNM counts.
Assuntos
Células Germinativas , Humanos , MutaçãoRESUMO
BACKGROUND: The study of de novo variation is important for assessing biological characteristics of new variation and for studies related to human phenotypes. Software programs exist to call de novo variants and programs also exist to test the burden of these variants in genomic regions; however, I am unaware of a program that fits in between these two aspects of de novo variant assessment. This intermediate space is important for assessing the quality of de novo variants and to understand the characteristics of the callsets. For this reason, I developed an R package called acorn. RESULTS: Acorn is an R package that examines various features of de novo variants including subsetting the data by individual(s), variant type, or genomic region; calculating features including variant change counts, variant lengths, and presence/absence at CpG sites; and characteristics of parental age in relation to de novo variant counts. CONCLUSIONS: Acorn is an R package that fills a critical gap in assessing de novo variants and will be of benefit to many investigators studying de novo variation.
Assuntos
Genômica , Software , Humanos , FenótipoRESUMO
Detection of de novo variants (DNVs) is critical for studies of disease-related variation and mutation rates. To accelerate DNV calling, we developed a graphics processing units-based workflow. We applied our workflow to whole-genome sequencing data from three parent-child sequenced cohorts including the Simons Simplex Collection (SSC), Simons Foundation Powering Autism Research (SPARK), and the 1000 Genomes Project (1000G) that were sequenced using DNA from blood, saliva, and lymphoblastoid cell lines (LCLs), respectively. The SSC and SPARK DNV callsets were within expectations for number of DNVs, percent at CpG sites, phasing to the paternal chromosome of origin, and average allele balance. However, the 1000G DNV callset was not within expectations and contained excessive DNVs that are likely cell line artifacts. Mutation signature analysis revealed 30% of 1000G DNV signatures matched B-cell lymphoma. Furthermore, we found variants in DNA repair genes and at Clinvar pathogenic or likely-pathogenic sites and significant excess of protein-coding DNVs in IGLL5; a gene known to be involved in B-cell lymphomas. Our study provides a new rapid DNV caller for the field and elucidates important implications of using sequencing data from LCLs for reference building and disease-related projects.
Assuntos
Neoplasias , Humanos , Alelos , Mutação , Neoplasias/genética , Sequenciamento Completo do GenomaRESUMO
While genes with an excess of de novo mutations (DNMs) have been identified in children with neurodevelopmental disorders (NDDs), few studies focus on DNM patterns where the sex of affected children is examined separately. We considered â¼8,825 sequenced parent-child trios (n â¼26,475 individuals) and identify 54 genes with a DNM enrichment in males (n = 18), females (n = 17), or overlapping in both the male and female subsets (n = 19). A replication cohort of 18,778 sequenced parent-child trios (n = 56,334 individuals) confirms 25 genes (n = 3 in males, n = 7 in females, n = 15 in both male and female subsets). As expected, we observe significant enrichment on the X chromosome for females but also find autosomal genes with potential sex bias (females, CDK13, ITPR1; males, CHD8, MBD5, SYNGAP1); 6.5% of females harbor a DNM in a female-enriched gene, whereas 2.7% of males have a DNM in a male-enriched gene. Sex-biased genes are enriched in transcriptional processes and chromatin binding, primarily reside in the nucleus of cells, and have brain expression. By downsampling, we find that DNM gene discovery is greatest when studying affected females. Finally, directly comparing de novo allele counts in NDD-affected males and females identifies one replicated genome-wide significant gene (DDX3X) with locus-specific enrichment in females. Our sex-based DNM enrichment analysis identifies candidate NDD genes differentially affecting males and females and indicates that the study of females with NDDs leads to greater gene discovery consistent with the female-protective effect.
Assuntos
Exoma/genética , Marcadores Genéticos , Mutação , Transtornos do Neurodesenvolvimento/genética , Criança , Estudos de Coortes , Feminino , Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla , Humanos , Masculino , Transtornos do Neurodesenvolvimento/patologia , Fenótipo , Fatores SexuaisRESUMO
BACKGROUND: Hirschsprung's disease, or congenital aganglionosis, is a developmental disorder of the enteric nervous system and is the most common cause of intestinal obstruction in neonates and infants. The disease has more than 80% heritability, including significant associations with rare and common sequence variants in genes related to the enteric nervous system, as well as with monogenic and chromosomal syndromes. METHODS: We genotyped and exome-sequenced samples from 190 patients with Hirschsprung's disease to quantify the genetic burden in patients with this condition. DNA sequence variants, large copy-number variants, and karyotype variants in probands were considered to be pathogenic when they were significantly associated with Hirschsprung's disease or another neurodevelopmental disorder. Novel genes were confirmed by functional studies in the mouse and human embryonic gut and in zebrafish embryos. RESULTS: The presence of five or more variants in four noncoding elements defined a widespread risk of Hirschsprung's disease (48.4% of patients and 17.1% of controls; odds ratio, 4.54; 95% confidence interval [CI], 3.19 to 6.46). Rare coding variants in 24 genes that play roles in enteric neural-crest cell fate, 7 of which were novel, were also common (34.7% of patients and 5.0% of controls) and conferred a much greater risk than noncoding variants (odds ratio, 10.02; 95% CI, 6.45 to 15.58). Large copy-number variants, which were present in fewer patients (11.4%, as compared with 0.2% of controls), conferred the highest risk (odds ratio, 63.07; 95% CI, 36.75 to 108.25). At least one identifiable genetic risk factor was found in 72.1% of the patients, and at least 48.4% of patients had a structural or regulatory deficiency in the gene encoding receptor tyrosine kinase (RET). For individual patients, the estimated risk of Hirschsprung's disease ranged from 5.33 cases per 100,000 live births (approximately 1 per 18,800) to 8.38 per 1000 live births (approximately 1 per 120). CONCLUSIONS: Among the patients in our study, Hirschsprung's disease arose from common noncoding variants, rare coding variants, and copy-number variants affecting genes involved in enteric neural-crest cell fate that exacerbate the widespread genetic susceptibility associated with RET. For individual patients, the genotype-specific odds ratios varied by a factor of approximately 67, which provides a basis for risk stratification and genetic counseling. (Funded by the National Institutes of Health.).
Assuntos
Variação Genética , Genótipo , Doença de Hirschsprung/genética , Exoma , Feminino , Predisposição Genética para Doença , Humanos , Masculino , Mutação , Razão de Chances , Penetrância , Análise de Sequência de DNA , Sequenciamento do ExomaRESUMO
MOTIVATION: An abundance of new reference genomes is becoming available through large-scale sequencing efforts. While the reference FASTA for each genome is available, there is currently no automated mechanism to query a specific sequence across all new reference genomes. RESULTS: We developed ACES (Analysis of Conservation with an Extensive list of Species) as a computational workflow to query specific sequences of interest (e.g. enhancers, promoters, exons) against reference genomes with an available reference FASTA. This automated workflow generates BLAST hits against each of the reference genomes, a multiple sequence alignment file, a graphical fragment assembly file and a phylogenetic tree file. These data files can then be used by the researcher in several ways to provide key insights into conservation of the query sequence. AVAILABILITY AND IMPLEMENTATION: ACES is available at https://github.com/TNTurnerLab/ACES. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Genoma , Software , Filogenia , Alinhamento de Sequência , ÉxonsRESUMO
BACKGROUND: Previous research in autism and other neurodevelopmental disorders (NDDs) has indicated an important contribution of protein-coding (coding) de novo variants (DNVs) within specific genes. The role of de novo noncoding variation has been observable as a general increase in genetic burden but has yet to be resolved to individual functional elements. In this study, we assessed whole-genome sequencing data in 2671 families with autism (discovery cohort of 516 families, replication cohort of 2155 families). We focused on DNVs in enhancers with characterized in vivo activity in the brain and identified an excess of DNVs in an enhancer named hs737. RESULTS: We adapted the fitDNM statistical model to work in noncoding regions and tested enhancers for excess of DNVs in families with autism. We found only one enhancer (hs737) with nominal significance in the discovery (p = 0.0172), replication (p = 2.5 × 10-3), and combined dataset (p = 1.1 × 10-4). Each individual with a DNV in hs737 had shared phenotypes including being male, intact cognitive function, and hypotonia or motor delay. Our in vitro assessment of the DNVs showed they all reduce enhancer activity in a neuronal cell line. By epigenomic analyses, we found that hs737 is brain-specific and targets the transcription factor gene EBF3 in human fetal brain. EBF3 is genome-wide significant for coding DNVs in NDDs (missense p = 8.12 × 10-35, loss-of-function p = 2.26 × 10-13) and is widely expressed in the body. Through characterization of promoters bound by EBF3 in neuronal cells, we saw enrichment for binding to NDD genes (p = 7.43 × 10-6, OR = 1.87) involved in gene regulation. Individuals with coding DNVs have greater phenotypic severity (hypotonia, ataxia, and delayed development syndrome [HADDS]) in comparison to individuals with noncoding DNVs that have autism and hypotonia. CONCLUSIONS: In this study, we identify DNVs in the hs737 enhancer in individuals with autism. Through multiple approaches, we find hs737 targets the gene EBF3 that is genome-wide significant in NDDs. By assessment of noncoding variation and the genes they affect, we are beginning to understand their impact on gene regulatory networks in NDDs.
Assuntos
Transtorno Autístico/genética , Predisposição Genética para Doença , Hipotonia Muscular/genética , Transtornos do Neurodesenvolvimento/genética , Fatores de Transcrição/genética , Transtorno Autístico/epidemiologia , Transtorno Autístico/patologia , Elementos Facilitadores Genéticos/genética , Exoma/genética , Feminino , Redes Reguladoras de Genes/genética , Humanos , Masculino , Hipotonia Muscular/epidemiologia , Hipotonia Muscular/patologia , Mutação/genética , Transtornos do Neurodesenvolvimento/epidemiologia , Transtornos do Neurodesenvolvimento/patologia , Neurônios/metabolismo , Neurônios/patologiaRESUMO
Currently, protein-coding de novo variants and large copy number variants have been identified as important for ~30% of individuals with autism. One approach to identify relevant variation in individuals who lack these types of events is by utilizing newer genomic technologies. In this study, highly accurate PacBio HiFi long-read sequencing was applied to a family with autism, epileptic encephalopathy, cognitive impairment, and mild dysmorphic features (two affected female siblings, unaffected parents, and one unaffected male sibling) with no known clinical variant. From our long-read sequencing data, a de novo missense variant in the KCNC2 gene (encodes Kv3.2) was identified in both affected children. This variant was phased to the paternal chromosome of origin and is likely a germline mosaic. In silico assessment revealed the variant was not in controls, highly conserved, and predicted damaging. This specific missense variant (Val473Ala) has been shown in both an ortholog and paralog of Kv3.2 to accelerate current decay, shift the voltage dependence of activation, and prevent the channel from entering a long-lasting open state. Seven additional missense variants have been identified in other individuals with neurodevelopmental disorders (p = 1.03 × 10-5 ). KCNC2 is most highly expressed in the brain; in particular, in the thalamus and is enriched in GABAergic neurons. Long-read sequencing was useful in discovering the relevant variant in this family with autism that had remained a mystery for several years and will potentially have great benefits in the clinic once it is widely available.
Assuntos
Transtorno Autístico , Epilepsia , Canais de Potássio Shaw , Transtorno Autístico/genética , Criança , Epilepsia/genética , Feminino , Células Germinativas , Humanos , Masculino , Mosaicismo , Mutação de Sentido Incorreto , Canais de Potássio Shaw/genéticaRESUMO
BACKGROUND: Copy number variants (CNVs) linked to genes involved in nervous system development or function are often associated with neuropsychiatric disease. While CNVs involving deletions generally cause severe and highly penetrant patient phenotypes, CNVs leading to duplications tend instead to exhibit widely variable and less penetrant phenotypic expressivity among affected individuals. CNVs located on chromosome 15q13.3 affecting the alpha-7 nicotinic acetylcholine receptor subunit (CHRNA7) gene contribute to multiple neuropsychiatric disorders with highly variable penetrance. However, the basis of such differential penetrance remains uncharacterized. Here, we generated induced pluripotent stem cell (iPSC) models from first-degree relatives with a 15q13.3 duplication and analyzed their cellular phenotypes to uncover a basis for the dissimilar phenotypic expressivity. RESULTS: The first-degree relatives studied included a boy with autism and emotional dysregulation (the affected proband-AP) and his clinically unaffected mother (UM), with comparison to unrelated control models lacking this duplication. Potential contributors to neuropsychiatric impairment were modeled in iPSC-derived cortical excitatory and inhibitory neurons. The AP-derived model uniquely exhibited disruptions of cellular physiology and neurodevelopment not observed in either the UM or unrelated controls. These included enhanced neural progenitor proliferation but impaired neuronal differentiation, maturation, and migration, and increased endoplasmic reticulum (ER) stress. Both the neuronal migration deficit and elevated ER stress could be selectively rescued by different pharmacologic agents. Neuronal gene expression was also dysregulated in the AP, including reduced expression of genes related to behavior, psychological disorders, neuritogenesis, neuronal migration, and Wnt, axonal guidance, and GABA receptor signaling. The UM model instead exhibited upregulated expression of genes in many of these same pathways, suggesting that molecular compensation could have contributed to the lack of neurodevelopmental phenotypes in this model. However, both AP- and UM-derived neurons exhibited shared alterations of neuronal function, including increased action potential firing and elevated cholinergic activity, consistent with increased homomeric CHRNA7 channel activity. CONCLUSIONS: These data define both diagnosis-associated cellular phenotypes and shared functional anomalies related to CHRNA7 duplication that may contribute to variable phenotypic penetrance in individuals with 15q13.3 duplication. The capacity for pharmacological agents to rescue some neurodevelopmental anomalies associated with diagnosis suggests avenues for intervention for carriers of this duplication and other CNVs that cause related disorders.
Assuntos
Cromossomos Humanos Par 15 , Variações do Número de Cópias de DNA , Receptor Nicotínico de Acetilcolina alfa7/genética , Cromossomos Humanos Par 15/genética , Humanos , Masculino , Neurônios , FenótipoRESUMO
Autism is a multifactorial neurodevelopmental disorder affecting more males than females; consequently, under a multifactorial genetic hypothesis, females are affected only when they cross a higher biological threshold. We hypothesize that deleterious variants at conserved residues are enriched in severely affected patients arising from female-enriched multiplex families with severe disease, enhancing the detection of key autism genes in modest numbers of cases. Here we show the use of this strategy by identifying missense and dosage sequence variants in the gene encoding the adhesive junction-associated δ-catenin protein (CTNND2) in female-enriched multiplex families and demonstrating their loss-of-function effect by functional analyses in zebrafish embryos and cultured hippocampal neurons from wild-type and Ctnnd2 null mouse embryos. Finally, through gene expression and network analyses, we highlight a critical role for CTNND2 in neuronal development and an intimate connection to chromatin biology. Our data contribute to the understanding of the genetic architecture of autism and suggest that genetic analyses of phenotypic extremes, such as female-enriched multiplex families, are of innate value in multifactorial disorders.
Assuntos
Transtorno Autístico/genética , Transtorno Autístico/metabolismo , Encéfalo/metabolismo , Cateninas/deficiência , Cateninas/genética , Animais , Encéfalo/embriologia , Cateninas/metabolismo , Células Cultivadas , Cromatina/genética , Cromatina/metabolismo , Variações do Número de Cópias de DNA/genética , Embrião de Mamíferos/citologia , Embrião de Mamíferos/metabolismo , Exoma/genética , Feminino , Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Hipocampo/patologia , Humanos , Masculino , Camundongos , Modelos Genéticos , Herança Multifatorial/genética , Mutação de Sentido Incorreto , Rede Nervosa , Neurônios/citologia , Neurônios/metabolismo , Caracteres Sexuais , Peixe-Zebra/embriologia , Peixe-Zebra/genética , Peixe-Zebra/metabolismo , delta CateninaRESUMO
We performed whole-genome sequencing (WGS) of 208 genomes from 53 families affected by simplex autism. For the majority of these families, no copy-number variant (CNV) or candidate de novo gene-disruptive single-nucleotide variant (SNV) had been detected by microarray or whole-exome sequencing (WES). We integrated multiple CNV and SNV analyses and extensive experimental validation to identify additional candidate mutations in eight families. We report that compared to control individuals, probands showed a significant (p = 0.03) enrichment of de novo and private disruptive mutations within fetal CNS DNase I hypersensitive sites (i.e., putative regulatory regions). This effect was only observed within 50 kb of genes that have been previously associated with autism risk, including genes where dosage sensitivity has already been established by recurrent disruptive de novo protein-coding mutations (ARID1B, SCN2A, NR3C2, PRKCA, and DSCAM). In addition, we provide evidence of gene-disruptive CNVs (in DISC1, WNT7A, RBFOX1, and MBD5), as well as smaller de novo CNVs and exon-specific SNVs missed by exome sequencing in neurodevelopmental genes (e.g., CANX, SAE1, and PIK3CA). Our results suggest that the detection of smaller, often multiple CNVs affecting putative regulatory elements might help explain additional risk of simplex autism.
Assuntos
Transtorno Autístico/genética , DNA/genética , Genoma Humano , Exoma , Feminino , Humanos , Masculino , Linhagem , Polimorfismo de Nucleotídeo ÚnicoRESUMO
PURPOSE: To maximize the discovery of potentially pathogenic variants to better understand the diagnostic utility of genome sequencing (GS) and to assess how the presence of multiple risk events might affect the phenotypic severity in autism spectrum disorders (ASD). METHODS: GS was applied to 180 simplex and multiplex ASD families (578 individuals, 213 patients) with exome sequencing and array comparative genomic hybridization further applied to a subset for validation and cross-platform comparisons. RESULTS: We found that 40.8% of patients carried variants with evidence of disease risk, including a de novo frameshift variant in NR4A2 and two de novo missense variants in SYNCRIP, while 21.1% carried clinically relevant pathogenic or likely pathogenic variants. Patients with more than one risk variant (9.9%) were more severely affected with respect to cognitive ability compared with patients with a single or no-risk variant. We observed no instance among the 27 multiplex families where a pathogenic or likely pathogenic variant was transmitted to all affected members in the family. CONCLUSION: The study demonstrates the diagnostic utility of GS, especially for multiple risk variants that contribute to the phenotypic severity, shows the genetic heterogeneity in multiplex families, and provides evidence for new genes for follow up.
Assuntos
Transtorno Autístico/genética , Sequenciamento do Exoma , Criança , Hibridização Genômica Comparativa , Variações do Número de Cópias de DNA , Análise Mutacional de DNA , Feminino , Humanos , Masculino , FenótipoRESUMO
Whole-exome and whole-genome sequencing have facilitated the large-scale discovery of de novo variants in human disease. To date, most de novo discovery through next-generation sequencing focused on congenital heart disease and neurodevelopmental disorders (NDDs). Currently, de novo variants are one of the most significant risk factors for NDDs with a substantial overlap of genes involved in more than one NDD. To facilitate better usage of published data, provide standardization of annotation, and improve accessibility, we created denovo-db (http://denovo-db.gs.washington.edu), a database for human de novo variants. As of July 2016, denovo-db contained 40 different studies and 32,991 de novo variants from 23,098 trios. Database features include basic variant information (chromosome location, change, type); detailed annotation at the transcript and protein levels; severity scores; frequency; validation status; and, most importantly, the phenotype of the individual with the variant. We included a feature on our browsable website to download any query result, including a downloadable file of the full database with additional variant details. denovo-db provides necessary information for researchers to compare their data to other individuals with the same phenotype and also to controls allowing for a better understanding of the biology of de novo variants and their contribution to disease.