Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 53
Filter
1.
JAMA Psychiatry ; 2024 Apr 17.
Article in English | MEDLINE | ID: mdl-38630491

ABSTRACT

Importance: Autism spectrum disorder (ASD) is a neurodevelopmental disorder more prevalent in males than in females. The cause of ASD is largely genetic, but the association of genetics with the skewed sex ratio is not yet understood. To our knowledge, no large population-based study has provided estimates of heritability by sex. Objective: To estimate the sex-specific heritability of ASD. Design, Setting, and Participants: This was a population-based, retrospective analysis using national health registers of nontwin siblings and cousins from Sweden born between January 1, 1985, and December 31, 1998, with follow-up to 19 years of age. Data analysis occurred from August 2022 to November 2023. Main Outcomes and Measures: Models were fitted to estimate the relative variance in risk for ASD occurrence owing to sex-specific additive genetics, shared environmental effects, and a common residual term. The residual term conceptually captured other factors that promote individual behavioral variation (eg, maternal effects, de novo variants, rare genetic variants not additively inherited, or gene-environment interactions). Estimates were adjusted for differences in prevalence due to birth year and maternal and paternal age by sex. Results: The sample included 1 047 649 individuals in 456 832 families (538 283 males [51.38%]; 509 366 females [48.62%]). Within the entire sample, 12 226 (1.17%) received a diagnosis of ASD, comprising 8128 (1.51%) males and 4098 (0.80%) females. ASD heritability was estimated at 87.0% (95% CI, 81.4%-92.6%) for males and 75.7% (95% CI, 68.4%-83.1%) for females with a difference in heritability estimated at 11.3% (95% CI, 1.0%-21.6%). There was no support for shared environmental contributions. Conclusions and Relevance: These findings suggest that the degree of phenotypic variation attributable to genetic differences (heritability) differs between males and females, indicating that some of the underlying causes of the condition may differ between the 2 sexes. The skewed sex ratio in ASD may be partly explained by differences in genetic variance between the sexes.

2.
medRxiv ; 2024 Feb 04.
Article in English | MEDLINE | ID: mdl-38352539

ABSTRACT

Missense de novo variants (DNVs) and missense somatic variants contribute to neurodevelopmental disorders (NDDs) and cancer, respectively. Proteins with statistical enrichment based on analyses of these variants exhibit convergence in the differing NDD and cancer phenotypes. Herein, the question of why some of the same proteins are identified in both phenotypes is examined through investigation of clustering of missense variation at the protein level. Our hypothesis is that missense variation is present in different protein locations in the two phenotypes leading to the distinct phenotypic outcomes. We tested this hypothesis in 1D protein space using our software CLUMP. Furthermore, we newly developed 3D-CLUMP that uses 3D protein structures to spatially test clustering of missense variation for proteome-wide significance. We examined missense DNVs in 39,883 parent-child sequenced trios with NDDs and missense somatic variants from 10,543 sequenced tumors covering five TCGA cancer types and two COSMIC pan-cancer aggregates of tissue types. There were 57 proteins with proteome-wide significant missense variation clustering in NDDs when compared to cancers and 79 proteins with proteome-wide significant missense clustering in cancers compared to NDDs. While our main objective was to identify differences in patterns of missense variation, we also identified a novel NDD protein BLTP2. Overall, our study is innovative, provides new insights into differential missense variation in NDDs and cancer at the protein-level, and contributes necessary information toward building a framework for thinking about prognostic and therapeutic aspects of these proteins.

3.
Genome Biol Evol ; 16(2)2024 Feb 01.
Article in English | MEDLINE | ID: mdl-38302106

ABSTRACT

Regions under balancing selection are characterized by dense polymorphisms and multiple persistent haplotypes, along with other sequence complexities. Successful identification of these patterns depends on both the statistical approach and the quality of sequencing. To address this challenge, at first, a new statistical method called LD-ABF was developed, employing efficient Bayesian techniques to effectively test for balancing selection. LD-ABF demonstrated the most robust detection of selection in a variety of simulation scenarios, compared against a range of existing tests/tools (Tajima's D, HKA, Dng, BetaScan, and BalLerMix). Furthermore, the impact of the quality of sequencing on detection of balancing selection was explored, as well, using: (i) SNP genotyping and exome data, (ii) targeted high-resolution HLA genotyping (IHIW), and (iii) whole-genome long-read sequencing data (Pangenome). In the analysis of SNP genotyping and exome data, we identified known targets and 38 new selection signatures in genes not previously linked to balancing selection. To further investigate the impact of sequencing quality on detection of balancing selection, a detailed investigation of the MHC was performed with high-resolution HLA typing data. Higher quality sequencing revealed the HLA-DQ genes consistently demonstrated strong selection signatures otherwise not observed from the sparser SNP array and exome data. The HLA-DQ selection signature was also replicated in the Pangenome samples using considerably less samples but, with high-quality long-read sequence data. The improved statistical method, coupled with higher quality sequencing, leads to more consistent identification of selection and enhanced localization of variants under selection, particularly in complex regions.


Subject(s)
HLA-DQ Antigens , Polymorphism, Single Nucleotide , Gene Frequency , Linkage Disequilibrium , Bayes Theorem , Haplotypes , HLA-DQ Antigens/genetics
4.
Bioinformatics ; 40(1)2024 01 02.
Article in English | MEDLINE | ID: mdl-38175776

ABSTRACT

MOTIVATION: de novo variants (DNVs) are variants that are present in offspring but not in their parents. DNVs are both important for examining mutation rates as well as in the identification of disease-related variation. While efforts have been made to call DNVs, calling of DNVs is still challenging from parent-child sequenced trio data. We developed Hare And Tortoise (HAT) as an automated DNV detection workflow for highly accurate short-read and long-read sequencing data. Reliable detection of DNVs is important for human genomics and HAT addresses this need. RESULTS: HAT is a computational workflow that begins with aligned read data (i.e. CRAM or BAM) from a parent-child sequenced trio and outputs DNVs. HAT detects high-quality DNVs from Illumina short-read whole-exome sequencing, Illumina short-read whole-genome sequencing, and highly accurate PacBio HiFi long-read whole-genome sequencing data. The quality of these DNVs is high based on a series of quality metrics including number of DNVs per individual, percent of DNVs at CpG sites, and percent of DNVs phased to the paternal chromosome of origin. AVAILABILITY AND IMPLEMENTATION: https://github.com/TNTurnerLab/HAT.


Subject(s)
Hares , Turtles , Animals , Humans , Turtles/genetics , Hares/genetics , Exome , Genome, Human , Whole Genome Sequencing , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA
5.
medRxiv ; 2024 Jan 26.
Article in English | MEDLINE | ID: mdl-38293138

ABSTRACT

Neurodevelopmental proteasomopathies represent a distinctive category of neurodevelopmental disorders (NDD) characterized by genetic variations within the 26S proteasome, a protein complex governing eukaryotic cellular protein homeostasis. In our comprehensive study, we identified 23 unique variants in PSMC5 , which encodes the AAA-ATPase proteasome subunit PSMC5/Rpt6, causing syndromic NDD in 38 unrelated individuals. Overexpression of PSMC5 variants altered human hippocampal neuron morphology, while PSMC5 knockdown led to impaired reversal learning in flies and loss of excitatory synapses in rat hippocampal neurons. PSMC5 loss-of-function resulted in abnormal protein aggregation, profoundly impacting innate immune signaling, mitophagy rates, and lipid metabolism in affected individuals. Importantly, targeting key components of the integrated stress response, such as PKR and GCN2 kinases, ameliorated immune dysregulations in cells from affected individuals. These findings significantly advance our understanding of the molecular mechanisms underlying neurodevelopmental proteasomopathies, provide links to research in neurodegenerative diseases, and open up potential therapeutic avenues.

6.
Hum Mol Genet ; 32(24): 3342-3352, 2023 Dec 01.
Article in English | MEDLINE | ID: mdl-37712888

ABSTRACT

Single nucleotide variants in the general population are common genomic alterations, where the majority are presumed to be silent polymorphisms without known clinical significance. Using human induced pluripotent stem cell (hiPSC) cerebral organoid modeling of the 1.4 megabase Neurofibromatosis type 1 (NF1) deletion syndrome, we previously discovered that the cytokine receptor-like factor-3 (CRLF3) gene, which is co-deleted with the NF1 gene, functions as a major regulator of neuronal maturation. Moreover, children with NF1 and the CRLF3L389P variant have greater autism burden, suggesting that this gene might be important for neurologic function. To explore the functional consequences of this variant, we generated CRLF3L389P-mutant hiPSC lines and Crlf3L389P-mutant genetically engineered mice. While this variant does not impair protein expression, brain structure, or mouse behavior, CRLF3L389P-mutant human cerebral organoids and mouse brains exhibit impaired neuronal maturation and dendrite formation. In addition, Crlf3L389P-mutant mouse neurons have reduced dendrite lengths and branching, without any axonal deficits. Moreover, Crlf3L389P-mutant mouse hippocampal neurons have decreased firing rates and synaptic current amplitudes relative to wild type controls. Taken together, these findings establish the CRLF3L389P variant as functionally deleterious and suggest that it may be a neurodevelopmental disease modifier.


Subject(s)
Induced Pluripotent Stem Cells , Child , Humans , Animals , Mice , Induced Pluripotent Stem Cells/metabolism , Neurons/metabolism , Brain/metabolism , Receptors, Cytokine/metabolism , Nucleotides/metabolism
7.
BMC Bioinformatics ; 24(1): 330, 2023 Sep 02.
Article in English | MEDLINE | ID: mdl-37660114

ABSTRACT

BACKGROUND: The study of de novo variation is important for assessing biological characteristics of new variation and for studies related to human phenotypes. Software programs exist to call de novo variants and programs also exist to test the burden of these variants in genomic regions; however, I am unaware of a program that fits in between these two aspects of de novo variant assessment. This intermediate space is important for assessing the quality of de novo variants and to understand the characteristics of the callsets. For this reason, I developed an R package called acorn. RESULTS: Acorn is an R package that examines various features of de novo variants including subsetting the data by individual(s), variant type, or genomic region; calculating features including variant change counts, variant lengths, and presence/absence at CpG sites; and characteristics of parental age in relation to de novo variant counts. CONCLUSIONS: Acorn is an R package that fills a critical gap in assessing de novo variants and will be of benefit to many investigators studying de novo variation.


Subject(s)
Genomics , Software , Humans , Phenotype
8.
bioRxiv ; 2023 Jun 07.
Article in English | MEDLINE | ID: mdl-37333171

ABSTRACT

Recently, Pacific Biosciences released a new highly accurate long-read sequencer called the Revio System that is projected to generate 30× HiFi whole-genome sequencing for the human genome within one sequencing SMRT Cell. Mouse and human genomes are similar in size. In this study, we sought to test this new sequencer by characterizing the genome and epigenome of the mouse neuronal cell line Neuro-2a. We generated long-read HiFi whole-genome sequencing on three Revio SMRT Cells, achieving a total coverage of 98×, with 30×, 32×, and 36× coverage respectively for each of the three Revio SMRT Cells. We performed several tests on these data including single-nucleotide variant and small insertion detection using GPU-accelerated DeepVariant, structural variant detection with pbsv, methylation detection with pb-CpG-tools, and generating de novo assemblies with the HiCanu and hifiasm assemblers. Overall, we find consistency across SMRT Cells in coverage, detection of variation, methylation, and de novo assemblies for each of the three SMRT Cells.

9.
Ann Clin Transl Neurol ; 10(5): 787-801, 2023 05.
Article in English | MEDLINE | ID: mdl-37000947

ABSTRACT

OBJECTIVE: The goal of this study is to demonstrate the utility of a growth assay to quantify the functional impact of single nucleotide variants (SNVs) in SLC2A1, the gene responsible for Glut1DS. METHODS: The functional impact of 40 SNVs in SLC2A1 was quantitatively determined in HAP1 cells in which SLC2A1 is required for growth. Donor libraries were introduced into the endogenous SLC2A1 gene in HAP1-Lig4KO cells using CRISPR/Cas9. Cell populations were harvested and sequenced to quantify the effect of variants on growth and generate a functional score. Quantitative functional scores were compared to 3-OMG uptake, SLC2A1 cell surface expression, CADD score, and clinical data, including CSF/blood glucose ratio. RESULTS: Nonsense variants (N = 3) were reduced in cell culture over time resulting in negative scores (mean score: -1.15 ± 0.17), whereas synonymous variants (N = 10) were not depleted (mean score: 0.25 ± 0.12) (P < 2e-16). Missense variants (N = 27) yielded a range of functional scores including slightly negative scores, supporting a partial function and intermediate phenotype. Several variants with normal results on either cell surface expression (p.N34S and p.W65R) or 3-OMG uptake (p.W65R) had negative functional scores. There is a moderate but significant correlation between our functional scores and CADD scores. INTERPRETATION: Cell growth is useful to quantitatively determine the functional effects of SLC2A1 variants. Nonsense variants were reliably distinguished from benign variants in this in vitro functional assay. For facilitating early diagnosis and therapeutic intervention, future work is needed to determine the functional effect of every possible variant in SLC2A1.


Subject(s)
Carbohydrate Metabolism, Inborn Errors , Humans , Phenotype , Carbohydrate Metabolism, Inborn Errors/genetics , Carbohydrate Metabolism, Inborn Errors/diagnosis , Monosaccharide Transport Proteins/genetics , Mutation, Missense , Glucose Transporter Type 1/genetics
10.
bioRxiv ; 2023 Jan 28.
Article in English | MEDLINE | ID: mdl-36747667

ABSTRACT

Motivation: de novo variant (DNV) calling is challenging from parent-child sequenced trio data. We developed Hare And Tortoise (HAT) to work as an automated workflow to detect DNVs in highly accurate short-read and long-read sequencing data. Reliable detection of DNVs is important for human genetics studies (e.g., autism, epilepsy). Results: HAT is a workflow to detect DNVs from short-read and long read sequencing data. This workflow begins with aligned read data (i.e., CRAM or BAM) from a parent-child sequenced trio and outputs DNVs. HAT detects high-quality DNVs from short-read whole-exome sequencing, short-read whole-genome sequencing, and highly accurate long-read sequencing data.

11.
Plant Commun ; 4(2): 100493, 2023 03 13.
Article in English | MEDLINE | ID: mdl-36397679

ABSTRACT

Genomic assemblies of the unicellular green alga Chlamydomonas reinhardtii have provided important resources for researchers. However, assembly errors, large gaps, and unplaced scaffolds as well as strain-specific variants currently impede many types of analysis. By combining PacBio HiFi and Oxford Nanopore long-read technologies, we generated a de novo genome assembly for strain CC-5816, derived from crosses of strains CC-125 and CC-124. Multiple methods of evaluating genome completeness and base-pair error rate suggest that the final telomere-to-telomere assembly is highly accurate. The CC-5816 assembly enabled previously difficult analyses that include characterization of the 17 centromeres, rDNA arrays on three chromosomes, and 56 insertions of organellar DNA into the nuclear genome. Using Nanopore sequencing, we identified sites of cytosine (CpG) methylation, which are enriched at centromeres. We analyzed CRISPR-Cas9 insertional mutants in the PF23 gene. Two of the three alleles produced progeny that displayed patterns of meiotic inviability that suggested the presence of a chromosomal aberration. Mapping Nanopore reads from pf23-2 and pf23-3 onto the CC-5816 genome showed that these two strains each carry a translocation that was initiated at the PF23 gene locus on chromosome 11 and joined with chromosomes 5 or 3, respectively. The translocations were verified by demonstrating linkage between loci on the two translocated chromosomes in meiotic progeny. The three pf23 alleles display the expected short-cilia phenotype, and immunoblotting showed that pf23-2 lacks the PF23 protein. Our CC-5816 genome assembly will undoubtedly provide an important tool for the Chlamydomonas research community.


Subject(s)
Chlamydomonas reinhardtii , Chlamydomonas reinhardtii/genetics , High-Throughput Nucleotide Sequencing/methods , Mutagenesis
12.
Hum Mutat ; 43(12): 1979-1993, 2022 12.
Article in English | MEDLINE | ID: mdl-36054329

ABSTRACT

Detection of de novo variants (DNVs) is critical for studies of disease-related variation and mutation rates. To accelerate DNV calling, we developed a graphics processing units-based workflow. We applied our workflow to whole-genome sequencing data from three parent-child sequenced cohorts including the Simons Simplex Collection (SSC), Simons Foundation Powering Autism Research (SPARK), and the 1000 Genomes Project (1000G) that were sequenced using DNA from blood, saliva, and lymphoblastoid cell lines (LCLs), respectively. The SSC and SPARK DNV callsets were within expectations for number of DNVs, percent at CpG sites, phasing to the paternal chromosome of origin, and average allele balance. However, the 1000G DNV callset was not within expectations and contained excessive DNVs that are likely cell line artifacts. Mutation signature analysis revealed 30% of 1000G DNV signatures matched B-cell lymphoma. Furthermore, we found variants in DNA repair genes and at Clinvar pathogenic or likely-pathogenic sites and significant excess of protein-coding DNVs in IGLL5; a gene known to be involved in B-cell lymphomas. Our study provides a new rapid DNV caller for the field and elucidates important implications of using sequencing data from LCLs for reference building and disease-related projects.


Subject(s)
Neoplasms , Humans , Alleles , Mutation , Neoplasms/genetics , Whole Genome Sequencing
13.
Nat Genet ; 54(9): 1305-1319, 2022 09.
Article in English | MEDLINE | ID: mdl-35982159

ABSTRACT

To capture the full spectrum of genetic risk for autism, we performed a two-stage analysis of rare de novo and inherited coding variants in 42,607 autism cases, including 35,130 new cases recruited online by SPARK. We identified 60 genes with exome-wide significance (P < 2.5 × 10-6), including five new risk genes (NAV3, ITSN1, MARK2, SCAF1 and HNRNPUL2). The association of NAV3 with autism risk is primarily driven by rare inherited loss-of-function (LoF) variants, with an estimated relative risk of 4, consistent with moderate effect. Autistic individuals with LoF variants in the four moderate-risk genes (NAV3, ITSN1, SCAF1 and HNRNPUL2; n = 95) have less cognitive impairment than 129 autistic individuals with LoF variants in highly penetrant genes (CHD8, SCN2A, ADNP, FOXP1 and SHANK3) (59% vs 88%, P = 1.9 × 10-6). Power calculations suggest that much larger numbers of autism cases are needed to identify additional moderate-risk genes.


Subject(s)
Autism Spectrum Disorder , Autistic Disorder , Autism Spectrum Disorder/genetics , Autistic Disorder/genetics , Exome/genetics , Forkhead Transcription Factors/genetics , Genetic Predisposition to Disease , Humans , Mutation , Repressor Proteins/genetics , Exome Sequencing
14.
Neuron ; 110(20): 3243-3262, 2022 10 19.
Article in English | MEDLINE | ID: mdl-35868305

ABSTRACT

Male sex is a strong risk factor for autism spectrum disorder (ASD). The leading theory for a "female protective effect" (FPE) envisions males and females have "differing thresholds" under a "liability threshold model" (DT-LTM). Specifically, this model posits that females require either a greater number or larger magnitude of risk factors (i.e., greater liability) to manifest ASD, which is supported by the finding that a greater proportion of females with ASD have highly penetrant genetic mutations. Herein, we derive testable hypotheses from the DT-LTM for ASD, investigating heritability, familial recurrence, correlation between ASD penetrance and sex ratio, population traits, clinical features, the stability of the sex ratio across diagnostic changes, and highlight other key prerequisites. Our findings reveal that several key predictions of the DT-LTM are not supported by current data, requiring us to establish a different conceptual framework for evaluating alternate models that explain sex differences in ASD.


Subject(s)
Autism Spectrum Disorder , Female , Male , Humans , Autism Spectrum Disorder/diagnosis , Sex Characteristics , Phenotype , Penetrance
15.
Genetics ; 221(2)2022 05 31.
Article in English | MEDLINE | ID: mdl-35385101

ABSTRACT

Genomic regions subject to purifying selection are more likely to carry disease-causing mutations than regions not under selection. Cross species conservation is often used to identify such regions but with limited resolution to detect selection on short evolutionary timescales such as that occurring in only one species. In contrast, genetic intolerance looks for depletion of variation relative to expectation within a species, allowing species-specific features to be identified. When estimating the intolerance of noncoding sequence, methods strongly leverage variant frequency distributions. As the expected distributions depend on ancestry, if not properly controlled for, ancestral population source may obfuscate signals of selection. We demonstrate that properly incorporating ancestry in intolerance estimation greatly improved variant classification. We provide a genome-wide intolerance map that is conditional on ancestry and likely to be particularly valuable for variant prioritization.


Subject(s)
Genome, Human , Genomics , Biological Evolution , Genetics, Population , Humans , Selection, Genetic
16.
Am J Med Genet A ; 188(7): 2071-2081, 2022 07.
Article in English | MEDLINE | ID: mdl-35366058

ABSTRACT

Currently, protein-coding de novo variants and large copy number variants have been identified as important for ~30% of individuals with autism. One approach to identify relevant variation in individuals who lack these types of events is by utilizing newer genomic technologies. In this study, highly accurate PacBio HiFi long-read sequencing was applied to a family with autism, epileptic encephalopathy, cognitive impairment, and mild dysmorphic features (two affected female siblings, unaffected parents, and one unaffected male sibling) with no known clinical variant. From our long-read sequencing data, a de novo missense variant in the KCNC2 gene (encodes Kv3.2) was identified in both affected children. This variant was phased to the paternal chromosome of origin and is likely a germline mosaic. In silico assessment revealed the variant was not in controls, highly conserved, and predicted damaging. This specific missense variant (Val473Ala) has been shown in both an ortholog and paralog of Kv3.2 to accelerate current decay, shift the voltage dependence of activation, and prevent the channel from entering a long-lasting open state. Seven additional missense variants have been identified in other individuals with neurodevelopmental disorders (p = 1.03 × 10-5 ). KCNC2 is most highly expressed in the brain; in particular, in the thalamus and is enriched in GABAergic neurons. Long-read sequencing was useful in discovering the relevant variant in this family with autism that had remained a mystery for several years and will potentially have great benefits in the clinic once it is widely available.


Subject(s)
Autistic Disorder , Epilepsy , Shaw Potassium Channels , Autistic Disorder/genetics , Child , Epilepsy/genetics , Female , Germ Cells , Humans , Male , Mosaicism , Mutation, Missense , Shaw Potassium Channels/genetics
17.
HGG Adv ; 3(1): 100081, 2022 Jan 13.
Article in English | MEDLINE | ID: mdl-35047865

ABSTRACT

While 9p deletion and duplication syndromes have been studied for several years, small sample sizes and minimal high-resolution data have limited a comprehensive delineation of genotypic and phenotypic characteristics. In this study, we examined genetic data from 719 individuals in the worldwide 9p Network Cohort: a cohort seven to nine times larger than any previous study of 9p. Most breakpoints occur in bands 9p22 and 9p24, accounting for 35% and 38% of all breakpoints, respectively. Bands 9p11 and 9p12 have the fewest breakpoints, with each accounting for 0.6% of all breakpoints. The most common phenotype in 9p deletion and duplication syndromes is developmental delay, and we identified eight known neurodevelopmental disorder genes in 9p22 and 9p24. Since it has been previously reported that some individuals have a secondary structural variant related to the 9p variant, we examined our cohort for these variants and found 97 events. The top secondary variant involved 9q in 14 individuals (1.9%), including ring chromosomes and inversions. We identified a gender bias with significant enrichment for females (p = 0.0006) that may arise from a sex reversal in some individuals with 9p deletions. Genes on 9p were characterized regarding function, constraint metrics, and protein-protein interactions, resulting in a prioritized set of genes for further study. Finally, we achieved precision genomics in one child with a complex 9p structural variation using modern genomic technologies, demonstrating that long-read sequencing will be integral for some cases. Our study is the largest ever on 9p-related syndromes and provides key insights into genetic factors involved in these syndromes.

18.
Neurol Genet ; 8(4): e200003, 2022 Aug.
Article in English | MEDLINE | ID: mdl-37435433

ABSTRACT

Objectives: Variants in the neurofibromatosis type 1 (NF1) gene are not only responsible for the NF1 cancer predisposition syndrome, but also frequently identified in cancers arising in the general population. While germline variants are pathogenic, it is not known whether those that arise in cancer (somatic variants) are passenger or driver variants. To address this question, we sought to define the landscape of NF1 variants in sporadic cancers. Methods: NF1 variants in sporadic cancers were compiled using data curated on the c-Bio database and compared with published germline variants and Genome Aggregation Database data. Pathogenicity was determined using Polyphen and Sorting Intolerant From Tolerant prediction tools. Results: The spectrum of NF1 variants in sporadic tumors differ from those most commonly seen in individuals with NF1. In addition, the type and location of the variants in sporadic cancer differ from germline variants, where a high proportion of missense variants were found. Finally, many of the sporadic cancer NF1 variants were not predicted to be pathogenic. Discussion: Taken together, these findings suggest that a significant proportion of NF1 variants in sporadic cancer may be passenger variants or hypomorphic alleles. Further mechanistic studies are warranted to define their unique roles in nonsyndromic cancer pathobiology.

19.
Nature ; 598(7879): 205-213, 2021 10.
Article in English | MEDLINE | ID: mdl-34616060

ABSTRACT

During mammalian development, differences in chromatin state coincide with cellular differentiation and reflect changes in the gene regulatory landscape1. In the developing brain, cell fate specification and topographic identity are important for defining cell identity2 and confer selective vulnerabilities to neurodevelopmental disorders3. Here, to identify cell-type-specific chromatin accessibility patterns in the developing human brain, we used a single-cell assay for transposase accessibility by sequencing (scATAC-seq) in primary tissue samples from the human forebrain. We applied unbiased analyses to identify genomic loci that undergo extensive cell-type- and brain-region-specific changes in accessibility during neurogenesis, and an integrative analysis to predict cell-type-specific candidate regulatory elements. We found that cerebral organoids recapitulate most putative cell-type-specific enhancer accessibility patterns but lack many cell-type-specific open chromatin regions that are found in vivo. Systematic comparison of chromatin accessibility across brain regions revealed unexpected diversity among neural progenitor cells in the cerebral cortex and implicated retinoic acid signalling in the specification of neuronal lineage identity in the prefrontal cortex. Together, our results reveal the important contribution of chromatin state to the emerging patterns of cell type diversity and cell fate specification and provide a blueprint for evaluating the fidelity and robustness of cerebral organoids as a model for cortical development.


Subject(s)
Brain/cytology , Epigenomics , Neurogenesis , Single-Cell Analysis , Atlases as Topic , Brain/growth & development , Brain/metabolism , Chromatin/chemistry , Chromatin/genetics , Chromatin/metabolism , Disease Susceptibility , Enhancer Elements, Genetic , Humans , Neurons/cytology , Neurons/metabolism , Organoids/cytology , Tretinoin/metabolism
20.
Bioinformatics ; 37(21): 3920-3922, 2021 11 05.
Article in English | MEDLINE | ID: mdl-34601580

ABSTRACT

MOTIVATION: An abundance of new reference genomes is becoming available through large-scale sequencing efforts. While the reference FASTA for each genome is available, there is currently no automated mechanism to query a specific sequence across all new reference genomes. RESULTS: We developed ACES (Analysis of Conservation with an Extensive list of Species) as a computational workflow to query specific sequences of interest (e.g. enhancers, promoters, exons) against reference genomes with an available reference FASTA. This automated workflow generates BLAST hits against each of the reference genomes, a multiple sequence alignment file, a graphical fragment assembly file and a phylogenetic tree file. These data files can then be used by the researcher in several ways to provide key insights into conservation of the query sequence. AVAILABILITY AND IMPLEMENTATION: ACES is available at https://github.com/TNTurnerLab/ACES. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genome , Software , Phylogeny , Sequence Alignment , Exons
SELECTION OF CITATIONS
SEARCH DETAIL
...