RESUMEN
Using five complementary short- and long-read sequencing technologies, we phased and assembled >95% of each diploid human genome in a four-generation, 28-member family (CEPH 1463) allowing us to systematically assess de novo mutations (DNMs) and recombination. From this family, we estimate an average of 192 DNMs per generation, including 75.5 de novo single-nucleotide variants (SNVs), 7.4 non-tandem repeat indels, 79.6 de novo indels or structural variants (SVs) originating from tandem repeats, 7.7 centromeric de novo SVs and SNVs, and 12.4 de novo Y chromosome events per generation. STRs and VNTRs are the most mutable with 32 loci exhibiting recurrent mutation through the generations. We accurately assemble 288 centromeres and six Y chromosomes across the generations, documenting de novo SVs, and demonstrate that the DNM rate varies by an order of magnitude depending on repeat content, length, and sequence identity. We show a strong paternal bias (75-81%) for all forms of germline DNM, yet we estimate that 17% of de novo SNVs are postzygotic in origin with no paternal bias. We place all this variation in the context of a high-resolution recombination map (~3.5 kbp breakpoint resolution). We observe a strong maternal recombination bias (1.36 maternal:paternal ratio) with a consistent reduction in the number of crossovers with increasing paternal (r=0.85) and maternal (r=0.65) age. However, we observe no correlation between meiotic crossover locations and de novo SVs, arguing against non-allelic homologous recombination as a predominant mechanism. The use of multiple orthogonal technologies, near-telomere-to-telomere phased genome assemblies, and a multi-generation family to assess transmission has created the most comprehensive, publicly available "truth set" of all classes of genomic variants. The resource can be used to test and benchmark new algorithms and technologies to understand the most fundamental processes underlying human genetic variation.
RESUMEN
Juvenile idiopathic arthritis (JIA) is a complex rheumatic disease encompassing several clinically defined subtypes of varying severity. The etiology of JIA remains largely unknown, but genome-wide association studies (GWASs) have identified up to 22 genes associated with JIA susceptibility, including a well-established association with HLA-DRB1. Continued investigation of heritable risk factors has been hindered by disease heterogeneity and low disease prevalence. In this study, we utilized shared genomic segments (SGS) analysis on whole-genome sequencing of 40 cases from 12 multi-generational pedigrees significantly enriched for JIA. Subsets of cases are connected by a common ancestor in large extended pedigrees, increasing the power to identify disease-associated loci. SGS analysis identifies genomic segments shared among disease cases that are likely identical by descent and anchored by a disease locus. This approach revealed statistically significant signals for major histocompatibility complex (MHC) class I and class III alleles, particularly HLA-A∗02:01, which was observed at a high frequency among cases. Furthermore, we identified an additional risk locus at 12q23.2-23.3, containing genes primarily expressed by naive B cells, natural killer cells, and monocytes. The recognition of additional risk beyond HLA-DRB1 provides a new perspective on immune cell dynamics in JIA. These findings contribute to our understanding of JIA and may guide future research and therapeutic strategies.
Asunto(s)
Artritis Juvenil , Humanos , Artritis Juvenil/genética , Cadenas HLA-DRB1/genética , Estudio de Asociación del Genoma Completo , Predisposición Genética a la Enfermedad/genética , Antígenos HLA/genética , Antígenos de Histocompatibilidad Clase I/genética , Factores de Riesgo , GenómicaRESUMEN
Misfolded proteins in the endoplasmic reticulum (ER) elicit the ER stress response, a large transcriptional response driven by 3 well-characterized transcription factors (TFs). This transcriptional response is variable across different genetic backgrounds. One mechanism in which genetic variation can lead to transcriptional variability in the ER stress response is through altered binding and activity of the 3 main TFs: XBP1, ATF6, and ATF4. This work attempts to better understand this mechanism by first creating a computational pipeline to identify potential binding sites throughout the human genome. We utilized GTEx data sets to identify cis-eQTLs that fall within predicted TF binding sites (TFBSs). We also utilized the ClinVar database to compare the number of pathogenic vs benign variants at different positions of the binding motifs. Finally, we performed a cis-eQTL analysis on human cell lines experiencing ER stress to identify cis-eQTLs that regulate the variable ER stress response. The majority of these cis-eQTLs are unique to a given condition: control or ER stress. Some of these stress-specific cis-eQTLs fall within putative binding sites of the 3 main ER stress response TFs, providing a potential mechanism by which these cis-eQTLs might be impacting gene expression under ER stress conditions through altered TF binding. This study represents the first cis-eQTL analysis on human samples experiencing ER stress and is a vital step toward identifying the genetic components responsible for the variable ER stress response.
Asunto(s)
Sitios de Carácter Cuantitativo , Factores de Transcripción , Humanos , Factores de Transcripción/genética , Sitios de Unión , Unión Proteica , Variación Genética , Polimorfismo de Nucleótido SimpleRESUMEN
One to two percent of couples suffer recurrent pregnancy loss and over 50% of the cases are unexplained. Whole genome sequencing (WGS) analysis has the potential to identify previously unrecognized causes of pregnancy loss, but few studies have been performed, and none have included DNA from families including parents, losses, and live births. We conducted a pilot WGS study in three families with unexplained recurrent pregnancy loss, including parents, healthy live births, and losses, which included an embryonic loss (<10 weeks' gestation), fetal deaths (10-20 weeks' gestation) and stillbirths (≥ 20 weeks' gestation). We used the Illumina platform for WGS and state-of-the-art protocols to identify single nucleotide variants (SNVs) following various modes of inheritance. We identified 87 SNVs involving 75 genes in embryonic loss (n = 1), 370 SNVs involving 228 genes in fetal death (n = 3), and 122 SNVs involving 122 genes in stillbirth (n = 2). Of these, 22 de novo, 6 inherited autosomal dominant and an X-linked recessive SNVs were pathogenic (probability of being loss-of-function intolerant >0.9), impacting known genes (e.g., DICER1, FBN2, FLT4, HERC1, and TAOK1) involved in embryonic/fetal development and congenital abnormalities. Further, we identified inherited missense compound heterozygous SNVs impacting genes (e.g., VWA5B2) in two fetal death samples. The variants were not identified as compound heterozygous SNVs in live births and population controls, providing evidence for haplosufficient genes relevant to pregnancy loss. In this pilot study, we provide evidence for de novo and inherited SNVs relevant to pregnancy loss. Our findings provide justification for conducting WGS using larger numbers of families and warrant validation by targeted sequencing to ascertain causal variants. Elucidating genes causing pregnancy loss may facilitate the development of risk stratification strategies and novel therapeutics.
Asunto(s)
Aborto Habitual , Embarazo , Femenino , Humanos , Proyectos Piloto , Aborto Habitual/genética , Mortinato/genética , Mortinato/epidemiología , Nacimiento Vivo , Proteínas Serina-Treonina Quinasas , Ribonucleasa III , ARN Helicasas DEAD-boxRESUMEN
OBJECTIVE: To determine whether stillbirth aggregates in families and quantify its familial risk using extended pedigrees. DESIGN: State-wide matched case-control study. SETTING: Utah, United States. POPULATION: Stillbirth cases (n = 9404) and live birth controls (18 808) between 1978 and 2019. METHODS: Using the Utah Population Database, a population-based genealogical resource linked with state fetal death and birth records, we identified high-risk pedigrees with excess familial aggregation of stillbirth using the Familial Standardised Incidence Ratio (FSIR). Stillbirth odds ratio (OR) for first-degree relatives (FDR), second-degree relatives (SDR) and third-degree relatives (TDR) of parents with a stillbirth (affected) and live birth (unaffected) were estimated using logistic regression models. MAIN OUTCOME MEASURES: Familial aggregation estimated using FSIR, and stillbirth OR estimated for FDR, SDR and TDR of affected and unaffected parents using logistic regression models. RESULTS: We identified 390 high-risk pedigrees with evidence for excess familial aggregation (FSIR ≥2.00; P-value <0.05). FDRs, SDRs and TDRs of affected parents had 1.14-fold (95% confidence interval [CI]: 1.04-1.26), 1.22-fold (95% CI 1.11-1.33) and 1.15-fold (95% CI 1.08-1.21) higher stillbirth odds compared with FDRs, SDRs and TDRs of unaffected parents, respectively. Parental sex-specific analyses showed male FDRs, SDRs and TDRs of affected fathers had 1.22-fold (95% CI 1.02-1.47), 1.38-fold (95% CI 1.17-1.62) and 1.17-fold (95% CI 1.05-1.30) higher stillbirth odds compared with those of unaffected fathers, respectively. FDRs, SDRs and TDRs of affected mothers had 1.12-fold (95% CI 0.98-1.28), 1.09-fold (95% CI 0.96-1.24) and 1.15-fold (95% CI 1.06-1.24) higher stillbirth odds compared with those of unaffected mothers, respectively. CONCLUSIONS: We provide evidence for familial aggregation of stillbirth. Our findings warrant investigation into genes associated with stillbirth and underscore the need to design large-scale studies to determine the genetic architecture of stillbirth.
Asunto(s)
Madres , Mortinato , Femenino , Embarazo , Humanos , Masculino , Estudios de Casos y Controles , Mortinato/epidemiología , Mortinato/genética , Linaje , Incidencia , Utah/epidemiología , Predisposición Genética a la Enfermedad , Factores de RiesgoRESUMEN
BACKGROUND: Short tandem repeats (STRs) compose approximately 3% of the genome, and mutations at STR loci have been linked to dozens of human diseases including amyotrophic lateral sclerosis, Friedreich ataxia, Huntington disease, and fragile X syndrome. Improving our understanding of these mutations would increase our knowledge of the mutational dynamics of the genome and may uncover additional loci that contribute to disease. To estimate the genome-wide pattern of mutations at STR loci, we analyze blood-derived whole-genome sequencing data for 544 individuals from 29 three-generation CEPH pedigrees. These pedigrees contain both sets of grandparents, the parents, and an average of 9 grandchildren per family. RESULTS: We use HipSTR to identify de novo STR mutations in the 2nd generation of these pedigrees and require transmission to the third generation for validation. Analyzing approximately 1.6 million STR loci, we estimate the empirical de novo STR mutation rate to be 5.24 × 10-5 mutations per locus per generation. Perfect repeats mutate about 2 × more often than imperfect repeats. De novo STRs are significantly enriched in Alu elements. CONCLUSIONS: Approximately 30% of new STR mutations occur within Alu elements, which compose only 11% of the genome, but only 10% are found in LINE-1 insertions, which compose 17% of the genome. Phasing these mutations to the parent of origin shows that parental transmission biases vary among families. We estimate the average number of de novo genome-wide STR mutations per individual to be approximately 85, which is similar to the average number of observed de novo single nucleotide variants.
Asunto(s)
Familia Extendida , Repeticiones de Microsatélite , Humanos , Mutación , Linaje , GenomaRESUMEN
OBJECTIVE: To identify novel disease associated loci for amyotrophic lateral sclerosis (ALS), we utilized sequencing data and performed in vitro and in vivo experiments to demonstrate pathogenicity of mutations identified in TP73. METHODS: We analyzed exome sequences of 87 sporadic ALS patients and 324 controls, with confirmatory sequencing in independent ALS cohorts of >2,800 patients. For the top hit, TP73, a regulator of apoptosis, differentiation, and a binding partner as well as homolog of the tumor suppressor gene TP53, we assayed mutation effects using in vitro and in vivo experiments. C2C12 myoblast differentiation assays, characterization of myotube appearance, and immunoprecipitation of p53-p73 complexes were perform in vitro. In vivo, we used CRISPR/Cas9 targeting of zebrafish tp73 to assay motor neuron number and axon morphology. RESULTS: Five heterozygous rare, nonsynonymous mutations in TP73 were identified in our sporadic ALS cohort. In independent ALS cohorts, we identified an additional 19 rare, deleterious variants in TP73. Patient TP73 mutations caused abnormal differentiation and increased apoptosis in the myoblast differentiation assay, with abnormal myotube appearance. Immunoprecipitation of mutant ΔN-p73 demonstrated that patient mutations hinder ΔN-p73's ability to bind p53. CRISPR/Cas9 knockout of tp73 in zebrafish led to impaired motor neuron development and abnormal axonal morphology, concordant with ALS pathology. CONCLUSION: Together, these results strongly suggest that variants in TP73 correlate with risk for ALS and indicate a novel role for apoptosis in ALS disease pathology.
RESUMEN
While mobile elements are largely inactive in healthy somatic tissues, increased activity has been found in cancer tissues, with significant variation among different cancer types. In addition to insertion events, mobile elements have also been found to mediate many structural variation events in the genome. Here, to better understand the timing and impact of mobile element insertions and associated structural variants in cancer, we examined their activity in longitudinal samples of four metastatic breast cancer patients. We identified 11 mobile element insertions or associated structural variants and found that the majority of these occurred early in tumor progression. Most of the variants impact intergenic regions; however, we identified a translocation interrupting MAP2K4 involving Alu elements and a deletion in YTHDF2 involving mobile elements that likely inactivate reported tumor suppressor genes. The high variant allele fraction of the translocation, the loss of the other copy of MAP2K4, the recurrent loss-of-function mutations found in this gene in other cancers, and the important function of MAP2K4 indicate that this translocation is potentially a driver mutation. Overall, using a unique longitudinal dataset, we find that most variants are likely passenger mutations in the four patients we examined, but some variants impact tumor progression.
Asunto(s)
Neoplasias de la Mama/genética , Elementos Transponibles de ADN/genética , Variación Estructural del Genoma , Mutagénesis Insercional/genética , Alelos , Cromosomas Humanos/genética , Femenino , Dosificación de Gen , Humanos , Estudios Longitudinales , MAP Quinasa Quinasa 4/genéticaRESUMEN
Clonal hematopoiesis of indeterminate potential (CHIP) is characterized by detectable hematopoietic-associated gene mutations in a person without evidence of hematologic malignancy. We sought to identify additional cancer-presenting mutations useable for CHIP detection by performing a data mining analysis of 48 somatic mutation studies reporting mutations at diagnoses of 7,430 adult and pediatric patients with hematologic malignancies. Following extraction of 20,141 protein-altering mutations, we identified 434 significantly recurrent mutation hotspots, 364 of which occurred at loci confidently assessable for CHIP. We then performed an additional large-scale analysis of whole exome sequencing data from 4,538 persons belonging to three non-cancer cohorts for clonal mutations. We found the combined cohort prevalence of CHIP with mutations identical to those reported at blood cancer mutation hotspots to be 1.8%, and that some of these CHIP mutations occurred in children. Our findings may help to improve CHIP detection and pre-cancer surveillance for both children and adults.
Asunto(s)
Neoplasias Hematológicas , Neoplasias , Adulto , Niño , Hematopoyesis Clonal , Neoplasias Hematológicas/diagnóstico , Hematopoyesis/genética , Humanos , Mutación , Neoplasias/diagnósticoRESUMEN
Each human genome includes de novo mutations that arose during gametogenesis. While these germline mutations represent a fundamental source of new genetic diversity, they can also create deleterious alleles that impact fitness. Whereas the rate and patterns of point mutations in the human germline are now well understood, far less is known about the frequency and features that impact de novo structural variants (dnSVs). We report a family-based study of germline mutations among 9,599 human genomes from 33 multigenerational CEPH-Utah families and 2,384 families from the Simons Foundation Autism Research Initiative. We find that de novo structural mutations detected by alignment-based, short-read WGS occur at an overall rate of at least 0.160 events per genome in unaffected individuals, and we observe a significantly higher rate (0.206 per genome) in ASD-affected individuals. In both probands and unaffected samples, nearly 73% of de novo structural mutations arose in paternal gametes, and we predict most de novo structural mutations to be caused by mutational mechanisms that do not require sequence homology. After multiple testing correction, we did not observe a statistically significant correlation between parental age and the rate of de novo structural variation in offspring. These results highlight that a spectrum of mutational mechanisms contribute to germline structural mutations and that these mechanisms most likely have markedly different rates and selective pressures than those leading to point mutations.
Asunto(s)
Familia , Genoma Humano/genética , Células Germinativas , Mutación de Línea Germinal/genética , Tasa de Mutación , Envejecimiento/genética , Trastorno Autístico/genética , Sesgo , Variaciones en el Número de Copia de ADN/genética , Análisis Mutacional de ADN , Femenino , Humanos , Masculino , Edad Paterna , Mutación Puntual/genéticaRESUMEN
Ageing may be due to mutation accumulation across the lifespan, leading to tissue dysfunction, disease, and death. We tested whether germline autosomal mutation rates in young adults predict their remaining survival, and, for women, their reproductive lifespans. Age-adjusted mutation rates (AAMRs) in 61 women and 61 men from the Utah CEPH (Centre d'Etude du Polymorphisme Humain) families were determined. Age at death, cause of death, all-site cancer incidence, and reproductive histories were provided by the Utah Population Database, Utah Cancer Registry, and Utah Genetic Reference Project. Higher AAMRs were significantly associated with higher all-cause mortality in both sexes combined. Subjects in the top quartile of AAMRs experienced more than twice the mortality of bottom quartile subjects (hazard ratio [HR], 2.07; 95% confidence interval [CI], 1.21-3.56; p = 0.008; median survival difference = 4.7 years). Fertility analyses were restricted to women whose age at last birth (ALB) was ≥ 30 years, the age when fertility begins to decline. Women with higher AAMRs had significantly fewer live births and a younger ALB. Adult germline mutation accumulation rates are established in adolescence, and later menarche in women is associated with delayed mutation accumulation. We conclude that germline mutation rates in healthy young adults may provide a measure of both reproductive and systemic ageing. Puberty may induce the establishment of adult mutation accumulation rates, just when DNA repair systems begin their lifelong decline.
Asunto(s)
Mutación de Línea Germinal , Longevidad/genética , Tasa de Mutación , Reproducción/genética , Femenino , Fertilidad/genética , Humanos , Nacimiento Vivo , Masculino , Embarazo , Sistema de Registros , Historia Reproductiva , Análisis de Supervivencia , Utah , Adulto JovenRESUMEN
Ongoing retrotransposition of Alu, LINE-1, and SINE-VNTR-Alu elements generates diversity and variation among human populations. Previous analyses investigating the population genetics of mobile element insertions (MEIs) have been limited by population ascertainment bias or by relatively small numbers of populations and low sequencing coverage. Here, we use 296 individuals representing 142 global populations from the Simons Genome Diversity Project (SGDP) to discover and characterize MEI diversity from deeply sequenced whole-genome data. We report 5,742 MEIs not originally reported by the 1000 Genomes Project and show that high sampling diversity leads to a 4- to 7-fold increase in MEI discovery rates over the original 1000 Genomes Project data. As a result of negative selection, nonreference polymorphic MEIs are underrepresented within genes, and MEIs within genes are often found in the transcriptional orientation opposite that of the gene. Globally, 80% of Alu subfamilies predate the expansion of modern humans from Africa. Polymorphic MEIs show heterozygosity gradients that decrease from Africa to Eurasia to the Americas, and the number of MEIs found uniquely in a single individual are also distributed in this general pattern. The maximum fraction of MEI diversity partitioned among the seven major SGDP population groups (FST) is 7.4%, similar to, but slightly lower than, previous estimates and likely attributable to the diverse sampling strategy of the SGDP. Finally, we utilize these MEIs to extrapolate the primary Native American shared ancestry component to back to Asia and provide new evidence from genome-wide identical-by-descent genetic markers that add additional support for a southeastern Siberian origin for most Native Americans.
Asunto(s)
Elementos Alu , Variación Genética , Genoma Humano , Elementos de Nucleótido Esparcido Largo , Humanos , FilogeografíaRESUMEN
African Americans have a wide range of continental genetic ancestry. It is unclear whether racial differences in blood pressure (BP) control are related to ancestral background. The authors analyzed data from the Jackson Heart Study, a cohort exclusively comprised of self-identified African Americans, to assess the association between estimated West African ancestry (WAA) and BP control (systolic and diastolic BP < 140/90 mm Hg). Three nested modified Poisson regression models were used to calculate prevalence ratios for BP control associated with the three upper quartiles, separately, vs the lowest quartile of West African ancestry. The authors analyzed data from 1658 participants with hypertension who reported taking all of their antihypertensive medications in the previous 24 hours. WAA was estimated using 389 ancestry informative markers and categorized into quartiles (Q1: <73.7%, Q2: >73.7%-81.0%, Q3: >81.0%-86.3%, and Q4: >86.3%). The proportion of participants with controlled BP in the lowest-to-highest WAA quartile was 75.2%, 76.1%, 76.6%, and 74.4%. The prevalence ratios (95% CI) for controlled BP comparing Q2, Q3, and Q4 to Q1 of WAA were 1.00 (0.93-1.08), 1.02 (0.94-1.10), and 0.99 (0.91-1.07), respectively. Among African Americans in the Jackson Heart Study taking antihypertensive medication, BP control rates did not differ across quartiles of WAA.
Asunto(s)
Antihipertensivos , Negro o Afroamericano , Hipertensión/etnología , África Occidental , Anciano , Antihipertensivos/uso terapéutico , Presión Sanguínea , Femenino , Humanos , Hipertensión/tratamiento farmacológico , Estudios Longitudinales , Masculino , Persona de Mediana Edad , Estados UnidosRESUMEN
Alu retrotransposons account for more than 10% of the human genome, and insertions of these elements create structural variants segregating in human populations. Such polymorphic Alus are powerful markers to understand population structure, and they represent variants that can greatly impact genome function, including gene expression. Accurate genotyping of Alus and other mobile elements has been challenging. Indeed, we found that Alu genotypes previously called for the 1000 Genomes Project are sometimes erroneous, which poses significant problems for phasing these insertions with other variants that comprise the haplotype. To ameliorate this issue, we introduce a new pipeline - TypeTE - which genotypes Alu insertions from whole-genome sequencing data. Starting from a list of polymorphic Alus, TypeTE identifies the hallmarks (poly-A tail and target site duplication) and orientation of Alu insertions using local re-assembly to reconstruct presence and absence alleles. Genotype likelihoods are then computed after re-mapping sequencing reads to the reconstructed alleles. Using a high-quality set of PCR-based genotyping of >200 loci, we show that TypeTE improves genotype accuracy from 83% to 92% in the 1000 Genomes dataset. TypeTE can be readily adapted to other retrotransposon families and brings a valuable toolbox addition for population genomics.
Asunto(s)
Secuencias Repetitivas Esparcidas/genética , Mutagénesis Insercional/genética , Programas Informáticos , Secuenciación Completa del Genoma/métodos , Bases de Datos Genéticas , Frecuencia de los Genes/genética , Sitios Genéticos , Genética de Población , Genoma Humano , Genotipo , HumanosRESUMEN
Germline mutation rates in humans have been estimated for a variety of mutation types, including single-nucleotide and large structural variants. Here, we directly measure the germline retrotransposition rate for the three active retrotransposon elements: L1, Alu, and SVA. We used three tools for calling mobile element insertions (MEIs) (MELT, RUFUS, and TranSurVeyor) on blood-derived whole-genome sequence (WGS) data from 599 CEPH individuals, comprising 33 three-generation pedigrees. We identified 26 de novo MEIs in 437 births. The retrotransposition rate estimates for Alu elements, one in 40 births, is roughly half the rate estimated using phylogenetic analyses, a difference in magnitude similar to that observed for single-nucleotide variants. The L1 retrotransposition rate is one in 63 births and is within range of previous estimates (1:20-1:200 births). The SVA retrotransposition rate, one in 63 births, is much higher than the previous estimate of one in 900 births. Our large, three-generation pedigrees allowed us to assess parent-of-origin effects and the timing of insertion events in either gametogenesis or early embryonic development. We find a statistically significant paternal bias in Alu retrotransposition. Our study represents the first in-depth analysis of the rate and dynamics of human retrotransposition from WGS data in three-generation human pedigrees.
Asunto(s)
Secuencias Repetitivas Esparcidas/genética , Filogenia , Retroelementos/genética , Secuenciación Completa del Genoma , Elementos Alu/genética , Animales , Femenino , Hominidae/sangre , Hominidae/genética , Humanos , Elementos de Nucleótido Esparcido Largo/genética , Masculino , Mutación , Linaje , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
The number of de novo mutations (DNMs) found in an offspring's genome increases with both paternal and maternal age. But does the rate of mutation accumulation in human gametes differ across families? Using sequencing data from 33 large, three-generation CEPH families, we observed significant variability in parental age effects on DNM counts across families, ranging from 0.19 to 3.24 DNMs per year. Additionally, we found that ~3% of DNMs originated following primordial germ cell specification in a parent, and differed from non-mosaic germline DNMs in their mutational spectra. We also discovered that nearly 10% of candidate DNMs in the second generation were post-zygotic, and present in both somatic and germ cells; these gonosomal mutations occurred at equivalent frequencies on both parental haplotypes. Our results demonstrate that rates of germline mutation accumulation vary among families with similar ancestry, and confirm that post-zygotic mosaicism is a substantial source of human DNM.
Asunto(s)
Salud de la Familia , Mutación de Línea Germinal , Mosaicismo , Acumulación de Mutaciones , Humanos , Tasa de Mutación , Análisis de Secuencia de ADN , UtahRESUMEN
Importance: Genome-wide association studies (GWASs) in European populations have identified more than 100 schizophrenia-associated loci. A schizophrenia GWAS in a unique Indian population offers novel findings. Objective: To discover and functionally evaluate genetic loci for schizophrenia in a GWAS of a unique Indian population. Design, Setting, and Participants: This GWAS included a sample of affected individuals, family members, and unrelated cases and controls. Three thousand ninety-two individuals were recruited and diagnostically ascertained via medical records, hospitals, clinics, and clinical networks in Chennai and surrounding regions. Affected participants fulfilled DSM-IV diagnostic criteria for schizophrenia. Unrelated control participants had no personal or family history of psychotic disorder. Recruitment, genotyping, and analysis occurred in consecutive phases beginning January 1, 2001. Recruitment was completed on February 28, 2018, and genotyping and analysis are ongoing. Main Outcomes and Measures: Associations of single-nucleotide polymorphisms and gene expression with schizophrenia. Results: The study population included 1321 participants with schizophrenia, 885 family controls, and 886 unrelated controls. Among participants with schizophrenia, mean (SD) age was 39.1 (11.4) years, and 52.7% were male. This sample demonstrated uniform ethnicity, a degree of inbreeding, and negligible rates of substance abuse. A novel genome-wide significant association was observed between schizophrenia and a chromosome 8q24.3 locus (rs10866912, allele A; odds ratio [OR], 1.27 [95% CI, 1.17-1.38]; P = 4.35 × 10-8) that attracted support in the schizophrenia Psychiatric Genomics Consortium 2 data (rs10866912, allele A; OR, 1.04 [95% CI, 1.02-1.06]; P = 7.56 × 10-4). This locus has undergone natural selection, with the risk allele A declining in frequency from India (approximately 72%) to Europe (approximately 43%). rs10866912 directly modifies the abundance of the nicotinate phosphoribosyltransferase gene (NAPRT1) transcript in brain cortex (normalized effect size, 0.79; 95% CI, 0.6-1.0; P = 5.8 × 10-13). NAPRT1 encodes a key enzyme for niacin metabolism. In Indian lymphoblastoid cell lines, (risk) allele A of rs10866912 was associated with NAPRT1 downregulation (AA: 0.74, n = 21; CC: 1.56, n = 17; P = .004). Preliminary zebrafish data further suggest that partial loss of function of NAPRT1 leads to abnormal brain development. Conclusions and Relevance: Bioinformatic analyses and cellular and zebrafish gene expression studies implicate NAPRT1 as a novel susceptibility gene. Given this gene's role in niacin metabolism and the evidence for niacin deficiency provoking schizophrenialike symptoms in neuropsychiatric diseases such as pellagra and Hartnup disease, these results suggest that the rs10866912 genotype and niacin status may have implications for schizophrenia susceptibility and treatment.
Asunto(s)
Cromosomas Humanos Par 8/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Niacina/metabolismo , Pentosiltransferasa/genética , Esquizofrenia/genética , Adulto , Animales , Estudios de Casos y Controles , Línea Celular Tumoral , Modelos Animales de Enfermedad , Familia , Femenino , Técnicas Genéticas , Humanos , India , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Pez CebraRESUMEN
The textbook view that most germline mutations in mammals arise from replication errors is indirectly supported by the fact that there are both more mutations and more cell divisions in the male than in the female germline. When analyzing large de novo mutation datasets in humans, we find multiple lines of evidence that call that view into question. Notably, despite the drastic increase in the ratio of male to female germ cell divisions after the onset of spermatogenesis, even young fathers contribute three times more mutations than young mothers, and this ratio barely increases with parental age. This surprising finding points to a substantial contribution of damage-induced mutations. Indeed, C-to-G transversions and CpG transitions, which together constitute over one-fourth of all base substitution mutations, show genomic distributions and sex-specific age dependencies indicative of double-strand break repair and methylation-associated damage, respectively. Moreover, we find evidence that maternal age at conception influences the mutation rate both because of the accumulation of damage in oocytes and potentially through an influence on the number of postzygotic mutations in the embryo. These findings reveal underappreciated roles of DNA damage and maternal age in the genesis of human germline mutations.
Asunto(s)
Roturas del ADN de Doble Cadena , Reparación del ADN , Bases de Datos de Ácidos Nucleicos , Mutación de Línea Germinal , Edad Materna , Adolescente , Adulto , Femenino , Humanos , Masculino , Persona de Mediana Edad , Oocitos , Embarazo , Espermatogénesis/genéticaRESUMEN
Crohn's disease is a complex genetic trait characterized by chronic relapsing intestinal inflammation. Genome wide association studies (GWAS) have identified more than 170 loci associated with the disease, accounting for â¼14% of the disease variance. We hypothesized that rare genetic variation in GWAS positional candidates also contribute to disease pathogenesis. We performed targeted, massively-parallel sequencing of 101 genes in 205 children with Crohn's disease, including 179 parent-child trios and 200 controls, both of European ancestry. We used the gene burden test implemented in VAAST and estimated effect sizes using logistic regression and meta-analyses. We identified three genes with nominally significant p-values: NOD2, RTKN2, and MGAT3 Only NOD2 was significant after correcting for multiple comparisons. We identified eight novel rare variants in NOD2 that are likely disease-associated. Incorporation of rare variation and compound heterozygosity nominally increased the proportion of variance explained from 0.074 to 0.089. We estimated the population attributable risk and total heritability of variation in NOD2 to be 32.9% and 3.4%, respectively, with 3.7% and 0.25% accounted for by rare putatively functional variants. Sequencing probands (as opposed to genotyping) to identify rare variants and incorporating phase by sequencing parents can recover a portion of the missing heritability of Crohn's disease.