RESUMO
Fewer than half of individuals with a suspected Mendelian or monogenic condition receive a precise molecular diagnosis after comprehensive clinical genetic testing. Improvements in data quality and costs have heightened interest in using long-read sequencing (LRS) to streamline clinical genomic testing, but the absence of control data sets for variant filtering and prioritization has made tertiary analysis of LRS data challenging. To address this, the 1000 Genomes Project (1KGP) Oxford Nanopore Technologies Sequencing Consortium aims to generate LRS data from at least 800 of the 1KGP samples. Our goal is to use LRS to identify a broader spectrum of variation so we may improve our understanding of normal patterns of human variation. Here, we present data from analysis of the first 100 samples, representing all 5 superpopulations and 19 subpopulations. These samples, sequenced to an average depth of coverage of 37× and sequence read N50 of 54 kbp, have high concordance with previous studies for identifying single nucleotide and indel variants outside of homopolymer regions. Using multiple structural variant (SV) callers, we identify an average of 24,543 high-confidence SVs per genome, including shared and private SVs likely to disrupt gene function as well as pathogenic expansions within disease-associated repeats that were not detected using short reads. Evaluation of methylation signatures revealed expected patterns at known imprinted loci, samples with skewed X-inactivation patterns, and novel differentially methylated regions. All raw sequencing data, processed data, and summary statistics are publicly available, providing a valuable resource for the clinical genetics community to discover pathogenic SVs.
RESUMO
The opportunistic fungal pathogen of humans Candida albicans is able to grow in different morphological forms such as round or oval yeasts and filamentous hyphae and pseudohyphae. Morphogenesis, the ability to switch between the yeast and filamentous growth forms, is important for adapting to new microenvironments in the human host and for pathogenesis. The molecular pathways governing morphogenesis are complex and incompletely understood. Previously, we identified several small organic molecules that specifically inhibit the initiation of hyphal growth in C. albicans without affecting cell viability or budded growth. One molecule from that screen is known to induce apoptosis in mammalian cells. In this study, we have screened additional inducers of mammalian apoptosis and identified BH3I-1, as well as several structural derivatives of BH3I-1, that act as specific inhibitors of morphogenesis under a variety of environmental conditions. Chemical epistasis experiments suggest that BH3I-1 acts downstream of the hypha-specific gene regulators Rfg1, Nrg1 and Ume6.
Assuntos
Candida albicans/efeitos dos fármacos , Tiazóis/farmacologia , Biologia Computacional , Hifas/efeitos dos fármacos , Hifas/crescimento & desenvolvimento , Tiazóis/química , TiazolidinedionasRESUMO
Less than half of individuals with a suspected Mendelian condition receive a precise molecular diagnosis after comprehensive clinical genetic testing. Improvements in data quality and costs have heightened interest in using long-read sequencing (LRS) to streamline clinical genomic testing, but the absence of control datasets for variant filtering and prioritization has made tertiary analysis of LRS data challenging. To address this, the 1000 Genomes Project ONT Sequencing Consortium aims to generate LRS data from at least 800 of the 1000 Genomes Project samples. Our goal is to use LRS to identify a broader spectrum of variation so we may improve our understanding of normal patterns of human variation. Here, we present data from analysis of the first 100 samples, representing all 5 superpopulations and 19 subpopulations. These samples, sequenced to an average depth of coverage of 37x and sequence read N50 of 54 kbp, have high concordance with previous studies for identifying single nucleotide and indel variants outside of homopolymer regions. Using multiple structural variant (SV) callers, we identify an average of 24,543 high-confidence SVs per genome, including shared and private SVs likely to disrupt gene function as well as pathogenic expansions within disease-associated repeats that were not detected using short reads. Evaluation of methylation signatures revealed expected patterns at known imprinted loci, samples with skewed X-inactivation patterns, and novel differentially methylated regions. All raw sequencing data, processed data, and summary statistics are publicly available, providing a valuable resource for the clinical genetics community to discover pathogenic SVs.
RESUMO
Sequence-based genetic testing identifies causative variants in ~ 50% of individuals with developmental and epileptic encephalopathies (DEEs). Aberrant changes in DNA methylation are implicated in various neurodevelopmental disorders but remain unstudied in DEEs. We interrogate the diagnostic utility of genome-wide DNA methylation array analysis on peripheral blood samples from 582 individuals with genetically unsolved DEEs. We identify rare differentially methylated regions (DMRs) and explanatory episignatures to uncover causative and candidate genetic etiologies in 12 individuals. Using long-read sequencing, we identify DNA variants underlying rare DMRs, including one balanced translocation, three CG-rich repeat expansions, and four copy number variants. We also identify pathogenic variants associated with episignatures. Finally, we refine the CHD2 episignature using an 850 K methylation array and bisulfite sequencing to investigate potential insights into CHD2 pathophysiology. Our study demonstrates the diagnostic yield of genome-wide DNA methylation analysis to identify causal and candidate variants as 2% (12/582) for unsolved DEE cases.
Assuntos
Variações do Número de Cópias de DNA , Metilação de DNA , Epilepsia , Humanos , Metilação de DNA/genética , Feminino , Criança , Masculino , Epilepsia/genética , Epilepsia/diagnóstico , Variações do Número de Cópias de DNA/genética , Pré-Escolar , Proteínas de Ligação a DNA/genética , Adolescente , Testes Genéticos/métodos , LactenteRESUMO
Sequence-based genetic testing currently identifies causative genetic variants in â¼50% of individuals with developmental and epileptic encephalopathies (DEEs). Aberrant changes in DNA methylation are implicated in various neurodevelopmental disorders but remain unstudied in DEEs. Rare epigenetic variations ("epivariants") can drive disease by modulating gene expression at single loci, whereas genome-wide DNA methylation changes can result in distinct "episignature" biomarkers for monogenic disorders in a growing number of rare diseases. Here, we interrogate the diagnostic utility of genome-wide DNA methylation array analysis on peripheral blood samples from 516 individuals with genetically unsolved DEEs who had previously undergone extensive genetic testing. We identified rare differentially methylated regions (DMRs) and explanatory episignatures to discover causative and candidate genetic etiologies in 10 individuals. We then used long-read sequencing to identify DNA variants underlying rare DMRs, including one balanced translocation, three CG-rich repeat expansions, and two copy number variants. We also identify pathogenic sequence variants associated with episignatures; some had been missed by previous exome sequencing. Although most DEE genes lack known episignatures, the increase in diagnostic yield for DNA methylation analysis in DEEs is comparable to the added yield of genome sequencing. Finally, we refine an episignature for CHD2 using an 850K methylation array which was further refined at higher CpG resolution using bisulfite sequencing to investigate potential insights into CHD2 pathophysiology. Our study demonstrates the diagnostic yield of genome-wide DNA methylation analysis to identify causal and candidate genetic causes as â¼2% (10/516) for unsolved DEE cases.
RESUMO
Elongator dysfunction is increasingly recognized as a contributor to multiple neurodevelopmental and neurodegenerative disorders including familial dysautonomia, intellectual disability, amyotrophic lateral sclerosis, and autism spectrum disorder. Although numerous cellular processes are perturbed in the context of Elongator loss, converging evidence from multiple studies has resolved Elongator's primary function in the cell to the modification of tRNA wobble uridines and the translational regulation of codon-biased genes. Here we characterize H2a.z, encoding the variant H2a histone H2A.Z, as an indirect Elongator target. We further show that canonical Notch signaling, a pathway directed by H2A.Z, is perturbed as a consequence of Elp1 loss. Finally, we demonstrate that hyperacetylation of H2A.Z and other histones via exposure to the histone deacetylase inhibitor Trichostatin A during neurogenesis corrects the expression of Notch3 and rescues the development of sensory neurons in embryos lacking the Elp1 Elongator subunit.
Assuntos
Histonas/metabolismo , Doenças Neurodegenerativas/genética , Receptores Notch/metabolismo , Transdução de Sinais/genética , Fatores de Elongação da Transcrição/genética , Humanos , Mutação com Perda de Função/genéticaRESUMO
Familial dysautonomia (FD) results from mutation in IKBKAP/ELP1, a gene encoding the scaffolding protein for the Elongator complex. This highly conserved complex is required for the translation of codon-biased genes in lower organisms. Here we investigate whether Elongator serves a similar function in mammalian peripheral neurons, the population devastated in FD. Using codon-biased eGFP sensors, and multiplexing of codon usage with transcriptome and proteome analyses of over 6,000 genes, we identify two categories of genes, as well as specific gene identities that depend on Elongator for normal expression. Moreover, we show that multiple genes in the DNA damage repair pathway are codon-biased, and that with Elongator loss, their misregulation is correlated with elevated levels of DNA damage. These findings link Elongator's function in the translation of codon-biased genes with both the developmental and neurodegenerative phenotypes of FD, and also clarify the increased risk of cancer associated with the disease.
Assuntos
Códon/genética , Disautonomia Familiar/metabolismo , Neurônios/metabolismo , Elongação Traducional da Cadeia Peptídica , Nervos Periféricos/metabolismo , Proteínas/metabolismo , Animais , Proteínas de Transporte/genética , Proteínas de Transporte/metabolismo , Células Cultivadas , Códon/metabolismo , Disautonomia Familiar/genética , Humanos , Peptídeos e Proteínas de Sinalização Intracelular , Camundongos , Camundongos Knockout , Neurônios/citologia , Nervos Periféricos/citologia , Proteínas/genéticaRESUMO
The opportunistic fungal pathogen Candida albicans can grow as yeast, pseudohyphae or true hyphae. C. albicans can switch between these morphologies in response to various environmental stimuli and this ability to switch is thought to be an important virulence trait. In Saccharomyces cerevisiae, the Grr1 protein is the substrate recognition component of an SCF ubiquitin ligase that regulates cell cycle progression, cell polarity and nutrient signaling. In this study, we have characterized the GRR1 gene of C. albicans. Deletion of GRR1 from the C. albicans genome results in a highly filamentous, pseudohyphal morphology under conditions that normally promote the yeast form of growth. Under hypha-inducing conditions, most cells lacking GRR1 retain a pseudohyphal morphology, but some cells appear to switch to hyphal-like growth and express the hypha-specific genes HWP1 and ECE1. The C. albicans GRR1 gene also complements the elongated cell morphology phenotype of an S. cerevisiae grr1Delta mutant, indicating that C. albicans GRR1 encodes a true orthologue of S. cerevisaie Grr1. These results support the hypothesis that the Grr1 protein of C. albicans, presumably as the F-box subunit of an SCF ubiquitin ligase, has an essential role in preventing the switch from the yeast cell morphology to a pseudohyphal morphology.