RESUMEN
Despite the growing number of genome-wide association studies (GWASs), it remains unclear to what extent gene-by-gene and gene-by-environment interactions influence complex traits in humans. The magnitude of genetic interactions in complex traits has been difficult to quantify because GWASs are generally underpowered to detect individual interactions of small effect. Here, we develop a method to test for genetic interactions that aggregates information across all trait-associated loci. Specifically, we test whether SNPs in regions of European ancestry shared between European American and admixed African American individuals have the same causal effect sizes. We hypothesize that in African Americans, the presence of genetic interactions will drive the causal effect sizes of SNPs in regions of European ancestry to be more similar to those of SNPs in regions of African ancestry. We apply our method to two traits: gene expression in 296 African Americans and 482 European Americans in the Multi-Ethnic Study of Atherosclerosis (MESA) and low-density lipoprotein cholesterol (LDL-C) in 74K African Americans and 296K European Americans in the Million Veteran Program (MVP). We find significant evidence for genetic interactions in our analysis of gene expression; for LDL-C, we observe a similar point estimate, although this is not significant, most likely due to lower statistical power. These results suggest that gene-by-gene or gene-by-environment interactions modify the effect sizes of causal variants in human complex traits.
Asunto(s)
Estudio de Asociación del Genoma Completo , Herencia Multifactorial , LDL-Colesterol , Expresión Génica , Humanos , Herencia Multifactorial/genética , Polimorfismo de Nucleótido Simple/genética , Población Blanca/genéticaRESUMEN
While polygenic risk scores (PRSs) enable early identification of genetic risk for chronic obstructive pulmonary disease (COPD), predictive performance is limited when the discovery and target populations are not well matched. Hypothesizing that the biological mechanisms of disease are shared across ancestry groups, we introduce a PrediXcan-derived polygenic transcriptome risk score (PTRS) to improve cross-ethnic portability of risk prediction. We constructed the PTRS using summary statistics from application of PrediXcan on large-scale GWASs of lung function (forced expiratory volume in 1 s [FEV1] and its ratio to forced vital capacity [FEV1/FVC]) in the UK Biobank. We examined prediction performance and cross-ethnic portability of PTRS through smoking-stratified analyses both on 29,381 multi-ethnic participants from TOPMed population/family-based cohorts and on 11,771 multi-ethnic participants from TOPMed COPD-enriched studies. Analyses were carried out for two dichotomous COPD traits (moderate-to-severe and severe COPD) and two quantitative lung function traits (FEV1 and FEV1/FVC). While the proposed PTRS showed weaker associations with disease than PRS for European ancestry, the PTRS showed stronger association with COPD than PRS for African Americans (e.g., odds ratio [OR] = 1.24 [95% confidence interval [CI]: 1.08-1.43] for PTRS versus 1.10 [0.96-1.26] for PRS among heavy smokers with ≥ 40 pack-years of smoking) for moderate-to-severe COPD. Cross-ethnic portability of the PTRS was significantly higher than the PRS (paired t test p < 2.2 × 10-16 with portability gains ranging from 5% to 28%) for both dichotomous COPD traits and across all smoking strata. Our study demonstrates the value of PTRS for improved cross-ethnic portability compared to PRS in predicting COPD risk.
Asunto(s)
Enfermedad Pulmonar Obstructiva Crónica , Transcriptoma , Humanos , Pulmón , National Heart, Lung, and Blood Institute (U.S.) , Enfermedad Pulmonar Obstructiva Crónica/genética , Factores de Riesgo , Estados Unidos/epidemiologíaRESUMEN
Platelets play a key role in thrombosis and hemostasis. Platelet count (PLT) and mean platelet volume (MPV) are highly heritable quantitative traits, with hundreds of genetic signals previously identified, mostly in European ancestry populations. We here utilize whole genome sequencing (WGS) from NHLBI's Trans-Omics for Precision Medicine initiative (TOPMed) in a large multi-ethnic sample to further explore common and rare variation contributing to PLT (n = 61 200) and MPV (n = 23 485). We identified and replicated secondary signals at MPL (rs532784633) and PECAM1 (rs73345162), both more common in African ancestry populations. We also observed rare variation in Mendelian platelet-related disorder genes influencing variation in platelet traits in TOPMed cohorts (not enriched for blood disorders). For example, association of GP9 with lower PLT and higher MPV was partly driven by a pathogenic Bernard-Soulier syndrome variant (rs5030764, p.Asn61Ser), and the signals at TUBB1 and CD36 were partly driven by loss of function variants not annotated as pathogenic in ClinVar (rs199948010 and rs571975065). However, residual signal remained for these gene-based signals after adjusting for lead variants, suggesting that additional variants in Mendelian genes with impacts in general population cohorts remain to be identified. Gene-based signals were also identified at several genome-wide association study identified loci for genes not annotated for Mendelian platelet disorders (PTPRH, TET2, CHEK2), with somatic variation driving the result at TET2. These results highlight the value of WGS in populations of diverse genetic ancestry to identify novel regulatory and coding signals, even for well-studied traits like platelet traits.
Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Plaquetas , Humanos , National Heart, Lung, and Blood Institute (U.S.) , Fenotipo , Polimorfismo de Nucleótido Simple , Medicina de Precisión/métodos , Estados UnidosRESUMEN
SLC37A4 encodes an endoplasmic reticulum (ER)-localized multitransmembrane protein required for transporting glucose-6-phosphate (Glc-6P) into the ER. Once transported into the ER, Glc-6P is subsequently hydrolyzed by tissue-specific phosphatases to glucose and inorganic phosphate during times of glucose depletion. Pathogenic variants in SLC37A4 cause an established recessive disorder known as glycogen storage disorder 1b characterized by liver and kidney dysfunction with neutropenia. We report seven individuals who presented with liver dysfunction multifactorial coagulation deficiency and cardiac issues and were heterozygous for the same variant, c.1267C>T (p.Arg423∗), in SLC37A4; the affected individuals were from four unrelated families. Serum samples from affected individuals showed profound accumulation of both high mannose and hybrid type N-glycans, while N-glycans in fibroblasts and undifferentiated iPSC were normal. Due to the liver-specific nature of this disorder, we generated a CRISPR base-edited hepatoma cell line harboring the c.1267C>T (p.Arg423∗) variant. These cells replicated the secreted abnormalities seen in serum N-glycosylation, and a portion of the mutant protein appears to relocate to a distinct, non-Golgi compartment, possibly ER exit sites. These cells also show a gene dosage-dependent alteration in the Golgi morphology and reduced intraluminal pH that may account for the altered glycosylation. In summary, we identify a recurrent mutation in SLC37A4 that causes a dominantly inherited congenital disorder of glycosylation characterized by coagulopathy and liver dysfunction with abnormal serum N-glycans.
Asunto(s)
Antiportadores/genética , Trastornos Congénitos de Glicosilación/etiología , Retículo Endoplásmico/patología , Hepatopatías/complicaciones , Proteínas de Transporte de Monosacáridos/genética , Mutación , Adulto , Niño , Preescolar , Trastornos Congénitos de Glicosilación/patología , Retículo Endoplásmico/genética , Retículo Endoplásmico/metabolismo , Femenino , Fibroblastos/metabolismo , Fibroblastos/patología , Genes Dominantes , Glicosilación , Humanos , Lactante , Recién Nacido , Masculino , LinajeRESUMEN
Despite widespread clinical genetic testing, many individuals with suspected genetic conditions lack a precise diagnosis, limiting their opportunity to take advantage of state-of-the-art treatments. In some cases, testing reveals difficult-to-evaluate structural differences, candidate variants that do not fully explain the phenotype, single pathogenic variants in recessive disorders, or no variants in genes of interest. Thus, there is a need for better tools to identify a precise genetic diagnosis in individuals when conventional testing approaches have been exhausted. We performed targeted long-read sequencing (T-LRS) using adaptive sampling on the Oxford Nanopore platform on 40 individuals, 10 of whom lacked a complete molecular diagnosis. We computationally targeted up to 151 Mbp of sequence per individual and searched for pathogenic substitutions, structural variants, and methylation differences using a single data source. We detected all genomic aberrations-including single-nucleotide variants, copy number changes, repeat expansions, and methylation differences-identified by prior clinical testing. In 8/8 individuals with complex structural rearrangements, T-LRS enabled more precise resolution of the mutation, leading to changes in clinical management in one case. In ten individuals with suspected Mendelian conditions lacking a precise genetic diagnosis, T-LRS identified pathogenic or likely pathogenic variants in six and variants of uncertain significance in two others. T-LRS accurately identifies pathogenic structural variants, resolves complex rearrangements, and identifies Mendelian variants not detected by other technologies. T-LRS represents an efficient and cost-effective strategy to evaluate high-priority genes and regions or complex clinical testing results.
Asunto(s)
Aberraciones Cromosómicas , Análisis Citogenético/métodos , Enfermedades Genéticas Congénitas/diagnóstico , Enfermedades Genéticas Congénitas/genética , Predisposición Genética a la Enfermedad , Genoma Humano , Mutación , Variaciones en el Número de Copia de ADN , Femenino , Pruebas Genéticas , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Cariotipificación , Masculino , Análisis de Secuencia de ADNRESUMEN
Moyamoya disease, a cerebrovascular disease leading to strokes in children and young adults, is characterized by progressive occlusion of the distal internal carotid arteries and the formation of collateral vessels. Altered genes play a prominent role in the aetiology of moyamoya disease, but a causative gene is not identified in the majority of cases. Exome sequencing data from 151 individuals from 84 unsolved families were analysed to identify further genes for moyamoya disease, then candidate genes assessed in additional cases (150 probands). Two families had the same rare variant in ANO1, which encodes a calcium-activated chloride channel, anoctamin-1. Haplotype analyses found the families were related, and ANO1 p.Met658Val segregated with moyamoya disease in the family with an LOD score of 3.3. Six additional ANO1 rare variants were identified in moyamoya disease families. The ANO1 rare variants were assessed using patch-clamp recordings, and the majority of variants, including ANO1 p.Met658Val, displayed increased sensitivity to intracellular Ca2+. Patients harbouring these gain-of-function ANO1 variants had classic features of moyamoya disease, but also had aneurysm, stenosis and/or occlusion in the posterior circulation. Our studies support that ANO1 gain-of-function pathogenic variants predispose to moyamoya disease and are associated with unique involvement of the posterior circulation.
Asunto(s)
Anoctamina-1 , Enfermedad de Moyamoya , Niño , Humanos , Adulto Joven , Anoctamina-1/genética , Canales de Cloruro/genética , Enfermedad de Moyamoya/genética , Proteínas de Neoplasias/genéticaRESUMEN
Rationale and Objectives: Up to 20% of idiopathic interstitial lung disease is familial, referred to as familial pulmonary fibrosis (FPF). An integrated analysis of FPF genetic risk was performed by comprehensively evaluating for genetic rare variants (RVs) in a large cohort of FPF kindreds. Methods: Whole-exome sequencing and/or candidate gene sequencing from affected individuals in 569 FPF kindreds was performed, followed by cosegregation analysis in large kindreds, gene burden analysis, gene-based risk scoring, cell-type enrichment analysis, and coexpression network construction. Measurements and Main Results: It was found that 14.9-23.4% of genetic risk in kindreds could be explained by RVs in genes previously linked to FPF, predominantly telomere-related genes. New candidate genes were identified in a small number of families-including SYDE1, SERPINB8, GPR87, and NETO1-and tools were developed for evaluation and prioritization of RV-containing genes across kindreds. Several pathways were enriched for RV-containing genes in FPF, including focal adhesion and mitochondrial complex I assembly. By combining single-cell transcriptomics with prioritized candidate genes, expression of RV-containing genes was discovered to be enriched in smooth muscle cells, type II alveolar epithelial cells, and endothelial cells. Conclusions: In the most comprehensive FPF genetic study to date, the prevalence of RVs in known FPF-related genes was defined, and new candidate genes and pathways relevant to FPF were identified. However, new RV-containing genes shared across multiple kindreds were not identified, thereby suggesting that heterogeneous genetic variants involving a variety of genes and pathways mediate genetic risk in most FPF kindreds.
Asunto(s)
Enfermedades Pulmonares Intersticiales , Fibrosis Pulmonar , Humanos , Fibrosis Pulmonar/genética , Células Endoteliales , Enfermedades Pulmonares Intersticiales/genética , Factores de Riesgo , Telómero , Predisposición Genética a la Enfermedad/genética , Receptores del Ácido Lisofosfatídico/genéticaRESUMEN
We identified ten persons in six consanguineous families with distal arthrogryposis (DA) who had congenital contractures, scoliosis, and short stature. Exome sequencing revealed that each affected person was homozygous for one of two different rare variants (c.470G>T [p.Cys157Phe] or c.469T>C [p.Cys157Arg]) affecting the same residue of myosin light chain, phosphorylatable, fast skeletal muscle (MYLPF). In a seventh family, a c.487G>A (p.Gly163Ser) variant in MYLPF arose de novo in a father, who transmitted it to his son. In an eighth family comprised of seven individuals with dominantly inherited DA, a c.98C>T (p.Ala33Val) variant segregated in all four persons tested. Variants in MYLPF underlie both dominant and recessively inherited DA. Mylpf protein models suggest that the residues associated with dominant DA interact with myosin whereas the residues altered in families with recessive DA only indirectly impair this interaction. Pathological and histological exam of a foot amputated from an affected child revealed complete absence of skeletal muscle (i.e., segmental amyoplasia). To investigate the mechanism for this finding, we generated an animal model for partial MYLPF impairment by knocking out zebrafish mylpfa. The mylpfa mutant had reduced trunk contractile force and complete pectoral fin paralysis, demonstrating that mylpf impairment most severely affects limb movement. mylpfa mutant muscle weakness was most pronounced in an appendicular muscle and was explained by reduced myosin activity and fiber degeneration. Collectively, our findings demonstrate that partial loss of MYLPF function can lead to congenital contractures, likely as a result of degeneration of skeletal muscle in the distal limb.
Asunto(s)
Artrogriposis/genética , Músculo Esquelético/patología , Anomalías Musculoesqueléticas/genética , Mutación/genética , Cadenas Ligeras de Miosina/genética , Adolescente , Secuencia de Aminoácidos , Animales , Niño , Contractura/genética , Extremidades/patología , Femenino , Humanos , Masculino , Miosinas/genética , Linaje , Adulto Joven , Pez Cebra/genéticaRESUMEN
Genetic testing has increased the number of variants identified in disease genes, but the diagnostic utility is limited by lack of understanding variant function. CARD11 encodes an adaptor protein that expresses dominant-negative and gain-of-function variants associated with distinct immunodeficiencies. Here, we used a "cloning-free" saturation genome editing approach in a diploid cell line to simultaneously score 2,542 variants for decreased or increased function in the region of CARD11 associated with immunodeficiency. We also described an exon-skipping mechanism for CARD11 dominant-negative activity. The classification of reported clinical variants was sensitive (94.6%) and specific (88.9%), which rendered the data immediately useful for interpretation of seven coding and splicing variants implicated in immunodeficiency found in our clinic. This approach is generalizable for variant interpretation in many other clinically actionable genes, in any relevant cell type.
Asunto(s)
Proteínas Adaptadoras de Señalización CARD/genética , Variación Genética , Guanilato Ciclasa/genética , Síndromes de Inmunodeficiencia/genética , Adenina/análogos & derivados , Adenina/farmacología , Proteína 10 de la LLC-Linfoma de Células B/genética , Linfocitos B/citología , Línea Celular , Diploidia , Exones , Genes Dominantes , Humanos , Células Jurkat , Linfoma/genética , Subunidad p50 de NF-kappa B/genética , Piperidinas/farmacología , Polimorfismo de Nucleótido Simple , Enfermedades de Inmunodeficiencia Primaria/genética , Sensibilidad y EspecificidadRESUMEN
Large datasets of hundreds to thousands of individuals measuring RNA-seq in observational studies are becoming available. Many popular software packages for analysis of RNA-seq data were constructed to study differences in expression signatures in an experimental design with well-defined conditions (exposures). In contrast, observational studies may have varying levels of confounding transcript-exposure associations; further, exposure measures may vary from discrete (exposed, yes/no) to continuous (levels of exposure), with non-normal distributions of exposure. We compare popular software for gene expression-DESeq2, edgeR and limma-as well as linear regression-based analyses for studying the association of continuous exposures with RNA-seq. We developed a computation pipeline that includes transformation, filtering and generation of empirical null distribution of association P-values, and we apply the pipeline to compute empirical P-values with multiple testing correction. We employ a resampling approach that allows for assessment of false positive detection across methods, power comparison and the computation of quantile empirical P-values. The results suggest that linear regression methods are substantially faster with better control of false detections than other methods, even with the resampling method to compute empirical P-values. We provide the proposed pipeline with fast algorithms in an R package Olivia, and implemented it to study the associations of measures of sleep disordered breathing with RNA-seq in peripheral blood mononuclear cells in participants from the Multi-Ethnic Study of Atherosclerosis.
Asunto(s)
Benchmarking/métodos , RNA-Seq , Análisis de Secuencia de ARN , Programas Informáticos , Algoritmos , Aterosclerosis/epidemiología , Aterosclerosis/etiología , Aterosclerosis/metabolismo , Simulación por Computador , Susceptibilidad a Enfermedades , Predisposición Genética a la Enfermedad , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Mutación , Fenotipo , Medición de Riesgo , Factores de Riesgo , Navegador WebRESUMEN
PURPOSE: To summarise the clinical, molecular and biochemical phenotype of mannosyl-oligosaccharide glucosidase-related congenital disorders of glycosylation (MOGS-CDG), which presents with variable clinical manifestations, and to analyse which clinical biochemical assay consistently supports diagnosis in individuals with bi-allelic variants in MOGS. METHODS: Phenotypic characterisation was performed through an international and multicentre collaboration. Genetic testing was done by exome sequencing and targeted arrays. Biochemical assays on serum and urine were performed to delineate the biochemical signature of MOGS-CDG. RESULTS: Clinical phenotyping revealed heterogeneity in MOGS-CDG, including neurological, immunological and skeletal phenotypes. Bi-allelic variants in MOGS were identified in 12 individuals from 11 families. The severity in each organ system was variable, without definite genotype correlation. Urine oligosaccharide analysis was consistently abnormal for all affected probands, whereas other biochemical analyses such as serum transferrin analysis was not consistently abnormal. CONCLUSION: The clinical phenotype of MOGS-CDG includes multisystemic involvement with variable severity. Molecular analysis, combined with biochemical testing, is important for diagnosis. In MOGS-CDG, urine oligosaccharide analysis via matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry can be used as a reliable biochemical test for screening and confirmation of disease.
RESUMEN
De novo mutations (DNMs), or mutations that appear in an individual despite not being seen in their parents, are an important source of genetic variation whose impact is relevant to studies of human evolution, genetics, and disease. Utilizing high-coverage whole-genome sequencing data as part of the Trans-Omics for Precision Medicine (TOPMed) Program, we called 93,325 single-nucleotide DNMs across 1,465 trios from an array of diverse human populations, and used them to directly estimate and analyze DNM counts, rates, and spectra. We find a significant positive correlation between local recombination rate and local DNM rate, and that DNM rate explains a substantial portion (8.98 to 34.92%, depending on the model) of the genome-wide variation in population-level genetic variation from 41K unrelated TOPMed samples. Genome-wide heterozygosity does correlate with DNM rate, but only explains <1% of variation. While we are underpowered to see small differences, we do not find significant differences in DNM rate between individuals of European, African, and Latino ancestry, nor across ancestrally distinct segments within admixed individuals. However, we did find significantly fewer DNMs in Amish individuals, even when compared with other Europeans, and even after accounting for parental age and sequencing center. Specifically, we found significant reductions in the number of CâA and TâC mutations in the Amish, which seem to underpin their overall reduction in DNMs. Finally, we calculated near-zero estimates of narrow sense heritability (h2), which suggest that variation in DNM rate is significantly shaped by nonadditive genetic effects and the environment.
Asunto(s)
Amish/genética , Genoma Humano , Adulto , Estudios de Cohortes , Análisis Mutacional de ADN , Femenino , Genética de Población , Heterocigoto , Humanos , Masculino , Mutación , Linaje , Secuenciación Completa del Genoma , Adulto JovenRESUMEN
BACKGROUND: The coronavirus disease 2019 (COVID-19) pandemic is dominated by variant viruses; the resulting impact on disease severity remains unclear. Using a retrospective cohort study, we assessed the hospitalization risk following infection with 7 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants. METHODS: Our study includes individuals with positive SARS-CoV-2 reverse transcription polymerase chain reaction (RT-PCR) in the Washington Disease Reporting System with available viral genome data, from 1 December 2020 to 14 January 2022. The analysis was restricted to cases with specimens collected through sentinel surveillance. Using a Cox proportional hazards model with mixed effects, we estimated hazard ratios (HR) for hospitalization risk following infection with a variant, adjusting for age, sex, calendar week, and vaccination. RESULTS: In total, 58 848 cases were sequenced through sentinel surveillance, of which 1705 (2.9%) were hospitalized due to COVID-19. Higher hospitalization risk was found for infections with Gamma (HR 3.20, 95% confidence interval [CI] 2.40-4.26), Beta (HR 2.85, 95% CI 1.56-5.23), Delta (HR 2.28 95% CI 1.56-3.34), or Alpha (HR 1.64, 95% CI 1.29-2.07) compared to infections with ancestral lineages; Omicron (HR 0.92, 95% CI .56-1.52) showed no significant difference in risk. Following Alpha, Gamma, or Delta infection, unvaccinated patients show higher hospitalization risk, while vaccinated patients show no significant difference in risk, both compared to unvaccinated, ancestral lineage cases. Hospitalization risk following Omicron infection is lower with vaccination. CONCLUSIONS: Infection with Alpha, Gamma, or Delta results in a higher hospitalization risk, with vaccination attenuating that risk. Our findings support hospital preparedness, vaccination, and genomic surveillance.
Asunto(s)
COVID-19 , SARS-CoV-2 , COVID-19/epidemiología , Hospitalización , Humanos , Estudios Retrospectivos , SARS-CoV-2/genética , Washingtón/epidemiologíaRESUMEN
BACKGROUND: Irinotecan (CPT-11) is an anticancer agent widely used to treat adult solid tumours. Large interindividual variability in the clearance of irinotecan and SN-38, its active and toxic metabolite, results in highly unpredictable toxicity. METHODS: In 217 cancer patients treated with intravenous irinotecan single agent or in combination, germline DNA was used to interrogate the variation in 84 genes by next-generation sequencing. A stepwise analytical framework including a population pharmacokinetic model with SNP- and gene-based testing was used to identify demographic/clinical/genetic factors that influence the clearance of irinotecan and SN-38. RESULTS: Irinotecan clearance was influenced by rs4149057 in SLCO1B1, body surface area, and co-administration of 5-fluorouracil/leucovorin/bevacizumab. SN-38 clearance was influenced by rs887829 in UGT1A1, pre-treatment total bilirubin, and EGFR rare variant burden. Within each UGT1A1 genotype group, elevated pre-treatment total bilirubin and/or presence of at least one rare variant in EGFR resulted in significantly lower SN-38 clearance. The model reduced the interindividual variability in irinotecan clearance from 38 to 34% and SN-38 clearance from 49 to 32%. CONCLUSIONS: This new model significantly reduced the interindividual variability in the clearance of irinotecan and SN-38. New genetic factors of variability in clearance have been identified.
Asunto(s)
Protocolos de Quimioterapia Combinada Antineoplásica/farmacocinética , Glucuronosiltransferasa/genética , Irinotecán/farmacocinética , Neoplasias/genética , Análisis de Secuencia de ADN/métodos , Administración Intravenosa , Adulto , Anciano , Anciano de 80 o más Años , Protocolos de Quimioterapia Combinada Antineoplásica/efectos adversos , Ensayos Clínicos como Asunto , Receptores ErbB/genética , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Irinotecán/efectos adversos , Transportador 1 de Anión Orgánico Específico del Hígado , Masculino , Persona de Mediana Edad , Neoplasias/tratamiento farmacológico , Variantes Farmacogenómicas , Polimorfismo de Nucleótido SimpleRESUMEN
Gene discovery for Mendelian conditions (MCs) offers a direct path to understanding genome function. Approaches based on next-generation sequencing applied at scale have dramatically accelerated gene discovery and transformed genetic medicine. Finding the genetic basis of â¼6,000-13,000 MCs yet to be delineated will require both technical and computational innovation, but will rely to a larger extent on meaningful data sharing.
Asunto(s)
Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , HumanosRESUMEN
Baratela-Scott syndrome (BSS) is a rare, autosomal-recessive disorder characterized by short stature, facial dysmorphisms, developmental delay, and skeletal dysplasia caused by pathogenic variants in XYLT1. We report clinical and molecular investigation of 10 families (12 individuals) with BSS. Standard sequencing methods identified biallelic pathogenic variants in XYLT1 in only two families. Of the remaining cohort, two probands had no variants and six probands had only a single variant, including four with a heterozygous 3.1 Mb 16p13 deletion encompassing XYLT1 and two with a heterozygous truncating variant. Bisulfite sequencing revealed aberrant hypermethylation in exon 1 of XYLT1, always in trans with the sequence variant or deletion when present; both alleles were methylated in those with no identified variant. Expression of the methylated XYLT1 allele was severely reduced in fibroblasts from two probands. Southern blot studies combined with repeat expansion analysis of genome sequence data showed that the hypermethylation is associated with expansion of a GGC repeat in the XYLT1 promoter region that is not present in the reference genome, confirming that BSS is a trinucleotide repeat expansion disorder. The hypermethylated allele accounts for 50% of disease alleles in our cohort and is not present in 130 control subjects. Our study highlights the importance of investigating non-sequence-based alterations, including epigenetic changes, to identify the missing heritability in genetic disorders.
Asunto(s)
Anomalías Múltiples/genética , Metilación de ADN/genética , Epigénesis Genética/genética , Exones/genética , Mutación , Pentosiltransferasa/genética , Expansión de Repetición de Trinucleótido/genética , Alelos , Southern Blotting , Estudios de Cohortes , Femenino , Humanos , Lactante , Recién Nacido , Masculino , Linaje , Sulfitos/metabolismo , Síndrome , UDP Xilosa Proteína XilosiltransferasaRESUMEN
Cerebellar malformations are diverse congenital anomalies frequently associated with developmental disability. Although genetic and prenatal non-genetic causes have been described, no systematic analysis has been performed. Here, we present a large-exome sequencing study of Dandy-Walker malformation (DWM) and cerebellar hypoplasia (CBLH). We performed exome sequencing in 282 individuals from 100 families with DWM or CBLH, and we established a molecular diagnosis in 36 of 100 families, with a significantly higher yield for CBLH (51%) than for DWM (16%). The 41 variants impact 27 neurodevelopmental-disorder-associated genes, thus demonstrating that CBLH and DWM are often features of monogenic neurodevelopmental disorders. Though only seven monogenic causes (19%) were identified in more than one individual, neuroimaging review of 131 additional individuals confirmed cerebellar abnormalities in 23 of 27 genetic disorders (85%). Prenatal risk factors were frequently found among individuals without a genetic diagnosis (30 of 64 individuals [47%]). Single-cell RNA sequencing of prenatal human cerebellar tissue revealed gene enrichment in neuronal and vascular cell types; this suggests that defective vasculogenesis may disrupt cerebellar development. Further, de novo gain-of-function variants in PDGFRB, a tyrosine kinase receptor essential for vascular progenitor signaling, were associated with CBLH, and this discovery links genetic and non-genetic etiologies. Our results suggest that genetic defects impact specific cerebellar cell types and implicate abnormal vascular development as a mechanism for cerebellar malformations. We also confirmed a major contribution for non-genetic prenatal factors in individuals with cerebellar abnormalities, substantially influencing diagnostic evaluation and counseling regarding recurrence risk and prognosis.
Asunto(s)
Cerebelo/anomalías , Cerebelo/diagnóstico por imagen , Estudios de Cohortes , Femenino , Humanos , Masculino , EmbarazoRESUMEN
PURPOSE: Mendelian disease genomic research has undergone a massive transformation over the past decade. With increasing availability of exome and genome sequencing, the role of Mendelian research has expanded beyond data collection, sequencing, and analysis to worldwide data sharing and collaboration. METHODS: Over the past 10 years, the National Institutes of Health-supported Centers for Mendelian Genomics (CMGs) have played a major role in this research and clinical evolution. RESULTS: We highlight the cumulative gene discoveries facilitated by the program, biomedical research leveraged by the approach, and the larger impact on the research community. Beyond generating a list of gene-phenotype relationships and participating in widespread data sharing, the CMGs have created resources, tools, and training for the larger community to foster understanding of genes and genome variation. The CMGs have participated in a wide range of data sharing activities, including deposition of all eligible CMG data into the Analysis, Visualization, and Informatics Lab-space (AnVIL), sharing candidate genes through the Matchmaker Exchange and the CMG website, and sharing variants in Genotypes to Mendelian Phenotypes (Geno2MP) and VariantMatcher. CONCLUSION: The work is far from complete; strengthening communication between research and clinical realms, continued development and sharing of knowledge and tools, and improving access to richly characterized data sets are all required to diagnose the remaining molecularly undiagnosed patients.
Asunto(s)
Exoma , Genómica , Estudios de Asociación Genética , Humanos , Fenotipo , Secuenciación del ExomaRESUMEN
Anophthalmia and microphthalmia (A/M) are rare birth defects affecting up to 2 per 10,000 live births. These conditions are manifested by the absence of an eye or reduced eye volumes within the orbit leading to vision loss. Although clinical case series suggest a strong genetic component in A/M, few systematic investigations have been conducted on potential genetic contributions owing to low population prevalence. To overcome this challenge, we utilized DNA samples and data collected as part of the National Birth Defects Prevention Study (NBDPS). The NBDPS employed multi-center ascertainment of infants affected by A/M. We performed exome sequencing on 67 family trios and identified numerous genes affected by rare deleterious nonsense and missense variants in this cohort, including de novo variants. We identified 9 nonsense changes and 86 missense variants that are absent from the reference human population (Genome Aggregation Database), and we suggest that these are high priority candidate genes for A/M. We also performed literature curation, single cell transcriptome comparisons, and molecular pathway analysis on the candidate genes and performed protein structure modeling to determine the potential pathogenic variant consequences on PAX6 in this disease.
Asunto(s)
Anoftalmos , Microftalmía , Anoftalmos/epidemiología , Exoma/genética , Humanos , Lactante , Microftalmía/epidemiología , Microftalmía/genética , Mutación Missense/genética , Secuenciación del ExomaRESUMEN
BACKGROUND: Otitis media (OM) susceptibility has significant heritability; however, the role of rare variants in OM is mostly unknown. Our goal is to identify novel rare variants that confer OM susceptibility. METHODS: We performed exome and Sanger sequencing of >1000 DNA samples from 551 multiethnic families with OM and unrelated individuals, RNA-sequencing and microbiome sequencing and analyses of swabs from the outer ear, middle ear, nasopharynx and oral cavity. We also examined protein localisation and gene expression in infected and healthy middle ear tissues. RESULTS: A large, intermarried pedigree that includes 81 OM-affected and 53 unaffected individuals cosegregates two known rare A2ML1 variants, a common FUT2 variant and a rare, novel pathogenic variant c.1682A>G (p.Glu561Gly) within SPINK5 (LOD=4.09). Carriage of the SPINK5 missense variant resulted in increased relative abundance of Microbacteriaceae in the middle ear, along with occurrence of Microbacteriaceae in the outer ear and oral cavity but not the nasopharynx. Eight additional novel SPINK5 variants were identified in 12 families and individuals with OM. A role for SPINK5 in OM susceptibility is further supported by lower RNA counts in variant carriers, strong SPINK5 localisation in outer ear skin, faint localisation to middle ear mucosa and eardrum and increased SPINK5 expression in human cholesteatoma. CONCLUSION: SPINK5 variants confer susceptibility to non-syndromic OM. These variants potentially contribute to middle ear pathology through breakdown of mucosal and epithelial barriers, immunodeficiency such as poor vaccination response, alteration of head and neck microbiota and facilitation of entry of opportunistic pathogens into the middle ear.