RESUMEN
DNA rearrangements resulting in human genome structural variants (SVs) are caused by diverse mutational mechanisms. We used long- and short-read sequencing technologies to investigate end products of de novo chromosome 17p11.2 rearrangements and query the molecular mechanisms underlying both recurrent and non-recurrent events. Evidence for an increased rate of clustered single-nucleotide variant (SNV) mutation in cis with non-recurrent rearrangements was found. Indel and SNV formation are associated with both copy-number gains and losses of 17p11.2, occur up to â¼1 Mb away from the breakpoint junctions, and favor C > G transversion substitutions; results suggest that single-stranded DNA is formed during the genesis of the SV and provide compelling support for a microhomology-mediated break-induced replication (MMBIR) mechanism for SV formation. Our data show an additional mutational burden of MMBIR consisting of hypermutation confined to the locus and manifesting as SNVs and indels predominantly within genes.
Asunto(s)
Cromosomas Humanos Par 17 , Mutación , Anomalías Múltiples/genética , Puntos de Rotura del Cromosoma , Trastornos de los Cromosomas/genética , Duplicación Cromosómica/genética , Variaciones en el Número de Copia de ADN , Reparación del ADN/genética , Replicación del ADN , Reordenamiento Génico , Genoma Humano , Variación Estructural del Genoma , Humanos , Mutación INDEL , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Recombinación Genética , Análisis de Secuencia de ADN/métodos , Síndrome de Smith-Magenis/genéticaRESUMEN
Diversity in the genetic lesions that cause cancer is extreme. In consequence, a pressing challenge is the development of drugs that target patient-specific disease mechanisms. To address this challenge, we employed a chemistry-first discovery paradigm for de novo identification of druggable targets linked to robust patient selection hypotheses. In particular, a 200,000 compound diversity-oriented chemical library was profiled across a heavily annotated test-bed of >100 cellular models representative of the diverse and characteristic somatic lesions for lung cancer. This approach led to the delineation of 171 chemical-genetic associations, shedding light on the targetability of mechanistic vulnerabilities corresponding to a range of oncogenotypes present in patient populations lacking effective therapy. Chemically addressable addictions to ciliogenesis in TTC21B mutants and GLUT8-dependent serine biosynthesis in KRAS/KEAP1 double mutants are prominent examples. These observations indicate a wealth of actionable opportunities within the complex molecular etiology of cancer.
Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas/patología , Proliferación Celular/efectos de los fármacos , Neoplasias Pulmonares/patología , Bibliotecas de Moléculas Pequeñas/farmacología , Carcinoma de Pulmón de Células no Pequeñas/metabolismo , Línea Celular Tumoral , Familia 4 del Citocromo P450/deficiencia , Familia 4 del Citocromo P450/genética , Descubrimiento de Drogas , Puntos de Control de la Fase G1 del Ciclo Celular/efectos de los fármacos , Glucocorticoides/farmacología , Proteínas Facilitadoras del Transporte de la Glucosa/antagonistas & inhibidores , Proteínas Facilitadoras del Transporte de la Glucosa/genética , Proteínas Facilitadoras del Transporte de la Glucosa/metabolismo , Humanos , Proteína 1 Asociada A ECH Tipo Kelch/genética , Proteína 1 Asociada A ECH Tipo Kelch/metabolismo , Neoplasias Pulmonares/metabolismo , Proteínas Asociadas a Microtúbulos/genética , Proteínas Asociadas a Microtúbulos/metabolismo , Mutación , Factor 2 Relacionado con NF-E2/antagonistas & inhibidores , Factor 2 Relacionado con NF-E2/genética , Factor 2 Relacionado con NF-E2/metabolismo , Proteínas Proto-Oncogénicas p21(ras)/genética , Proteínas Proto-Oncogénicas p21(ras)/metabolismo , Interferencia de ARN , ARN Interferente Pequeño/metabolismo , Receptor Notch2/genética , Receptor Notch2/metabolismo , Receptores de Glucocorticoides/antagonistas & inhibidores , Receptores de Glucocorticoides/genética , Receptores de Glucocorticoides/metabolismo , Bibliotecas de Moléculas Pequeñas/química , Bibliotecas de Moléculas Pequeñas/metabolismoRESUMEN
De novo copy number variants (dnCNVs) arising at multiple loci in a personal genome have usually been considered to reflect cancer somatic genomic instabilities. We describe a multiple dnCNV (MdnCNV) phenomenon in which individuals with genomic disorders carry five to ten constitutional dnCNVs. These CNVs originate from independent formation incidences, are predominantly tandem duplications or complex gains, exhibit breakpoint junction features reminiscent of replicative repair, and show increased de novo point mutations flanking the rearrangement junctions. The active CNV mutation shower appears to be restricted to a transient perizygotic period. We propose that a defect in the CNV formation process is responsible for the "CNV-mutator state," and this state is dampened after early embryogenesis. The constitutional MdnCNV phenomenon resembles chromosomal instability in various cancers. Investigations of this phenomenon may provide unique access to understanding genomic disorders, structural variant mutagenesis, human evolution, and cancer biology.
Asunto(s)
Aberraciones Cromosómicas , Variaciones en el Número de Copia de ADN , Enfermedades Genéticas Congénitas/embriología , Enfermedades Genéticas Congénitas/genética , Inestabilidad Genómica , Mutación , Puntos de Rotura del Cromosoma , Duplicación Cromosómica , Replicación del ADN , Desarrollo Embrionario , Femenino , Gametogénesis , Humanos , MasculinoRESUMEN
Invertebrate model systems are powerful tools for studying human disease owing to their genetic tractability and ease of screening. We conducted a mosaic genetic screen of lethal mutations on the Drosophila X chromosome to identify genes required for the development, function, and maintenance of the nervous system. We identified 165 genes, most of whose function has not been studied in vivo. In parallel, we investigated rare variant alleles in 1,929 human exomes from families with unsolved Mendelian disease. Genes that are essential in flies and have multiple human homologs were found to be likely to be associated with human diseases. Merging the human data sets with the fly genes allowed us to identify disease-associated mutations in six families and to provide insights into microcephaly associated with brain dysgenesis. This bidirectional synergism between fly genetics and human genomics facilitates the functional annotation of evolutionarily conserved genes involved in human health.
Asunto(s)
Enfermedad/genética , Drosophila melanogaster/genética , Pruebas Genéticas , Patrón de Herencia , Interferencia de ARN , Animales , Modelos Animales de Enfermedad , Humanos , Cromosoma XRESUMEN
CLP1 is a RNA kinase involved in tRNA splicing. Recently, CLP1 kinase-dead mice were shown to display a neuromuscular disorder with loss of motor neurons and muscle paralysis. Human genome analyses now identified a CLP1 homozygous missense mutation (p.R140H) in five unrelated families, leading to a loss of CLP1 interaction with the tRNA splicing endonuclease (TSEN) complex, largely reduced pre-tRNA cleavage activity, and accumulation of linear tRNA introns. The affected individuals develop severe motor-sensory defects, cortical dysgenesis, and microcephaly. Mice carrying kinase-dead CLP1 also displayed microcephaly and reduced cortical brain volume due to the enhanced cell death of neuronal progenitors that is associated with reduced numbers of cortical neurons. Our data elucidate a neurological syndrome defined by CLP1 mutations that impair tRNA splicing. Reduction of a founder mutation to homozygosity illustrates the importance of rare variations in disease and supports the clan genomics hypothesis.
Asunto(s)
Enfermedades del Sistema Nervioso Central/genética , Mutación Missense , Proteínas Nucleares/metabolismo , Enfermedades del Sistema Nervioso Periférico/genética , Fosfotransferasas/metabolismo , ARN de Transferencia/metabolismo , Factores de Transcripción/metabolismo , Anomalías Múltiples/genética , Anomalías Múltiples/patología , Animales , Enfermedades del Sistema Nervioso Central/patología , Cerebro/patología , Preescolar , Endorribonucleasas/metabolismo , Femenino , Fibroblastos/metabolismo , Humanos , Lactante , Masculino , Ratones , Ratones Endogámicos CBA , Microcefalia/genética , Enfermedades del Sistema Nervioso Periférico/patología , ARN de Transferencia/genética , Proteínas de Unión al ARNRESUMEN
Ion channel mutations are an important cause of rare Mendelian disorders affecting brain, heart, and other tissues. We performed parallel exome sequencing of 237 channel genes in a well-characterized human sample, comparing variant profiles of unaffected individuals to those with the most common neuronal excitability disorder, sporadic idiopathic epilepsy. Rare missense variation in known Mendelian disease genes is prevalent in both groups at similar complexity, revealing that even deleterious ion channel mutations confer uncertain risk to an individual depending on the other variants with which they are combined. Our findings indicate that variant discovery via large scale sequencing efforts is only a first step in illuminating the complex allelic architecture underlying personal disease risk. We propose that in silico modeling of channel variation in realistic cell and network models will be crucial to future strategies assessing mutation profile pathogenicity and drug response in individuals with a broad spectrum of excitability disorders.
Asunto(s)
Epilepsia/genética , Perfilación de la Expresión Génica , Canales Iónicos/genética , Polimorfismo de Nucleótido Simple , Simulación por Computador , Epistasis Genética , Hipocampo/metabolismo , Humanos , Mutación Missense , Neuronas/metabolismo , Medición de RiesgoRESUMEN
Among breast cancers, triple-negative breast cancer (TNBC) is the most poorly understood and is refractory to current targeted therapies. Using a genetic screen, we identify the PTPN12 tyrosine phosphatase as a tumor suppressor in TNBC. PTPN12 potently suppresses mammary epithelial cell proliferation and transformation. PTPN12 is frequently compromised in human TNBCs, and we identify an upstream tumor-suppressor network that posttranscriptionally controls PTPN12. PTPN12 suppresses transformation by interacting with and inhibiting multiple oncogenic tyrosine kinases, including HER2 and EGFR. The tumorigenic and metastatic potential of PTPN12-deficient TNBC cells is severely impaired upon restoration of PTPN12 function or combined inhibition of PTPN12-regulated tyrosine kinases, suggesting that TNBCs are dependent on the proto-oncogenic tyrosine kinases constrained by PTPN12. Collectively, these data identify PTPN12 as a commonly inactivated tumor suppressor and provide a rationale for combinatorially targeting proto-oncogenic tyrosine kinases in TNBC and other cancers based on their profile of tyrosine-phosphatase activity.
Asunto(s)
Neoplasias de la Mama/metabolismo , Proteína Tirosina Fosfatasa no Receptora Tipo 12/genética , Proteína Tirosina Fosfatasa no Receptora Tipo 12/metabolismo , Proteínas Supresoras de Tumor/metabolismo , Neoplasias de la Mama/tratamiento farmacológico , Línea Celular Tumoral , Transformación Celular Neoplásica , Receptores ErbB/metabolismo , Femenino , Regulación Neoplásica de la Expresión Génica , Humanos , Sistema de Señalización de MAP Quinasas , MicroARNs/metabolismo , Mutación , Metástasis de la Neoplasia , Procesamiento Proteico-PostraduccionalRESUMEN
A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing.
Asunto(s)
Variación Genética , Genoma Humano/genética , Secuenciación Completa del Genoma , Alelos , Estudios de Casos y Controles , Epigénesis Genética , Femenino , Dosificación de Gen/genética , Genética de Población , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Anotación de Secuencia Molecular , Sitios de Carácter Cuantitativo , Grupos Raciales/genética , Programas InformáticosRESUMEN
The African continent is regarded as the cradle of modern humans and African genomes contain more genetic variation than those from any other continent, yet only a fraction of the genetic diversity among African individuals has been surveyed1. Here we performed whole-genome sequencing analyses of 426 individuals-comprising 50 ethnolinguistic groups, including previously unsampled populations-to explore the breadth of genomic diversity across Africa. We uncovered more than 3 million previously undescribed variants, most of which were found among individuals from newly sampled ethnolinguistic groups, as well as 62 previously unreported loci that are under strong selection, which were predominantly found in genes that are involved in viral immunity, DNA repair and metabolism. We observed complex patterns of ancestral admixture and putative-damaging and novel variation, both within and between populations, alongside evidence that Zambia was a likely intermediate site along the routes of expansion of Bantu-speaking populations. Pathogenic variants in genes that are currently characterized as medically relevant were uncommon-but in other genes, variants denoted as 'likely pathogenic' in the ClinVar database were commonly observed. Collectively, these findings refine our current understanding of continental migration, identify gene flow and the response to human disease as strong drivers of genome-level population variation, and underscore the scientific imperative for a broader characterization of the genomic diversity of African individuals to understand human ancestry and improve health.
Asunto(s)
Variación Genética , Genoma Humano/genética , Genómica , Salud , Migración Humana , África/etnología , Reparación del ADN/genética , Conjuntos de Datos como Asunto , Femenino , Flujo Génico , Genética Médica , Genética de Población , Salud/historia , Historia Antigua , Migración Humana/historia , Humanos , Inmunidad/genética , Lenguaje , Masculino , Metabolismo/genética , Selección Genética , Secuenciación Completa del GenomaRESUMEN
Homozygous duplications contribute to genetic disease by altering gene dosage or disrupting gene regulation and can be more deleterious to organismal biology than heterozygous duplications. Intragenic exonic duplications can result in loss-of-function (LoF) or gain-of-function (GoF) alleles that when homozygosed, i.e. brought to homozygous state at a locus by identity by descent or state, could potentially result in autosomal recessive (AR) rare disease traits. However, the detection and functional interpretation of homozygous duplications from exome sequencing data remains a challenge. We developed a framework algorithm, HMZDupFinder, that is designed to detect exonic homozygous duplications from exome sequencing (ES) data. The HMZDupFinder algorithm can efficiently process large datasets and accurately identifies small intragenic duplications, including those associated with rare disease traits. HMZDupFinder called 965 homozygous duplications with three or less exons from 8,707 ES with a recall rate of 70.9% and a precision of 16.1%. We experimentally confirmed 8/10 rare homozygous duplications. Pathogenicity assessment of these copy number variant alleles allowed clinical genomics contextualization for three homozygous duplications alleles, including two affecting known OMIM disease genes EDAR (MIM# 224900), TNNT1(MIM# 605355), and one variant in a novel candidate disease gene: PAAF1.
Asunto(s)
Variaciones en el Número de Copia de ADN , Secuenciación del Exoma , Programas Informáticos , Humanos , Proteínas Adaptadoras Transductoras de Señales , Homocigoto , Enfermedades Raras/genéticaRESUMEN
BACKGROUND: Kinesin motor proteins transport intracellular cargo, including mRNA, proteins, and organelles. Pathogenic variants in kinesin-related genes have been implicated in neurodevelopmental disorders and skeletal dysplasias. We identified de novo, heterozygous variants in KIF5B, encoding a kinesin-1 subunit, in four individuals with osteogenesis imperfecta. The variants cluster within the highly conserved kinesin motor domain and are predicted to interfere with nucleotide binding, although the mechanistic consequences on cell signaling and function are unknown. METHODS: To understand the in vivo genetic mechanism of KIF5B variants, we modeled the p.Thr87Ile variant that was found in two patients in the C. elegans ortholog, unc-116, at the corresponding position (Thr90Ile) by CRISPR/Cas9 editing and performed functional analysis. Next, we studied the cellular and molecular consequences of the recurrent p.Thr87Ile variant by microscopy, RNA and protein analysis in NIH3T3 cells, primary human fibroblasts and bone biopsy. RESULTS: C. elegans heterozygous for the unc-116 Thr90Ile variant displayed abnormal body length and motility phenotypes that were suppressed by additional copies of the wild type allele, consistent with a dominant negative mechanism. Time-lapse imaging of GFP-tagged mitochondria showed defective mitochondria transport in unc-116 Thr90Ile neurons providing strong evidence for disrupted kinesin motor function. Microscopy studies in human cells showed dilated endoplasmic reticulum, multiple intracellular vacuoles, and abnormal distribution of the Golgi complex, supporting an intracellular trafficking defect. RNA sequencing, proteomic analysis, and bone immunohistochemistry demonstrated down regulation of the mTOR signaling pathway that was partially rescued with leucine supplementation in patient cells. CONCLUSION: We report dominant negative variants in the KIF5B kinesin motor domain in individuals with osteogenesis imperfecta. This study expands the spectrum of kinesin-related disorders and identifies dysregulated signaling targets for KIF5B in skeletal development.
Asunto(s)
Cinesinas , Osteogénesis Imperfecta , Animales , Humanos , Ratones , Caenorhabditis elegans/genética , Caenorhabditis elegans/metabolismo , Proteínas Portadoras/genética , Regulación hacia Abajo , Cinesinas/genética , Cinesinas/metabolismo , Células 3T3 NIH , Proteómica , Transducción de Señal/genética , Serina-Treonina Quinasas TOR/genética , Serina-Treonina Quinasas TOR/metabolismoRESUMEN
While polygenic risk scores (PRSs) enable early identification of genetic risk for chronic obstructive pulmonary disease (COPD), predictive performance is limited when the discovery and target populations are not well matched. Hypothesizing that the biological mechanisms of disease are shared across ancestry groups, we introduce a PrediXcan-derived polygenic transcriptome risk score (PTRS) to improve cross-ethnic portability of risk prediction. We constructed the PTRS using summary statistics from application of PrediXcan on large-scale GWASs of lung function (forced expiratory volume in 1 s [FEV1] and its ratio to forced vital capacity [FEV1/FVC]) in the UK Biobank. We examined prediction performance and cross-ethnic portability of PTRS through smoking-stratified analyses both on 29,381 multi-ethnic participants from TOPMed population/family-based cohorts and on 11,771 multi-ethnic participants from TOPMed COPD-enriched studies. Analyses were carried out for two dichotomous COPD traits (moderate-to-severe and severe COPD) and two quantitative lung function traits (FEV1 and FEV1/FVC). While the proposed PTRS showed weaker associations with disease than PRS for European ancestry, the PTRS showed stronger association with COPD than PRS for African Americans (e.g., odds ratio [OR] = 1.24 [95% confidence interval [CI]: 1.08-1.43] for PTRS versus 1.10 [0.96-1.26] for PRS among heavy smokers with ≥ 40 pack-years of smoking) for moderate-to-severe COPD. Cross-ethnic portability of the PTRS was significantly higher than the PRS (paired t test p < 2.2 × 10-16 with portability gains ranging from 5% to 28%) for both dichotomous COPD traits and across all smoking strata. Our study demonstrates the value of PTRS for improved cross-ethnic portability compared to PRS in predicting COPD risk.
Asunto(s)
Enfermedad Pulmonar Obstructiva Crónica , Transcriptoma , Humanos , Pulmón , National Heart, Lung, and Blood Institute (U.S.) , Enfermedad Pulmonar Obstructiva Crónica/genética , Factores de Riesgo , Estados Unidos/epidemiologíaRESUMEN
Plasma levels of fibrinogen, coagulation factors VII and VIII and von Willebrand factor (vWF) are four intermediate phenotypes that are heritable and have been associated with the risk of clinical thrombotic events. To identify rare and low-frequency variants associated with these hemostatic factors, we conducted whole-exome sequencing in 10 860 individuals of European ancestry (EA) and 3529 African Americans (AAs) from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and the National Heart, Lung and Blood Institute's Exome Sequencing Project. Gene-based tests demonstrated significant associations with rare variation (minor allele frequency < 5%) in fibrinogen gamma chain (FGG) (with fibrinogen, P = 9.1 × 10-13), coagulation factor VII (F7) (with factor VII, P = 1.3 × 10-72; seven novel variants) and VWF (with factor VIII and vWF; P = 3.2 × 10-14; one novel variant). These eight novel rare variant associations were independent of the known common variants at these loci and tended to have much larger effect sizes. In addition, one of the rare novel variants in F7 was significantly associated with an increased risk of venous thromboembolism in AAs (Ile200Ser; rs141219108; P = 4.2 × 10-5). After restricting gene-based analyses to only loss-of-function variants, a novel significant association was detected and replicated between factor VIII levels and a stop-gain mutation exclusive to AAs (rs3211938) in CD36 molecule (CD36). This variant has previously been linked to dyslipidemia but not with the levels of a hemostatic factor. These efforts represent the largest integration of whole-exome sequence data from two national projects to identify genetic variation associated with plasma hemostatic factors.
Asunto(s)
Factor VIII , Hemostáticos , Factor VII/genética , Factor VIII/genética , Fibrinógeno/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación del Exoma , Factor de von Willebrand/análisis , Factor de von Willebrand/genéticaRESUMEN
Coatomer complexes function in the sorting and trafficking of proteins between subcellular organelles. Pathogenic variants in coatomer subunits or associated factors have been reported in multi-systemic disorders, i.e., coatopathies, that can affect the skeletal and central nervous systems. We have identified loss-of-function variants in COPB2, a component of the coatomer complex I (COPI), in individuals presenting with osteoporosis, fractures, and developmental delay of variable severity. Electron microscopy of COPB2-deficient subjects' fibroblasts showed dilated endoplasmic reticulum (ER) with granular material, prominent rough ER, and vacuoles, consistent with an intracellular trafficking defect. We studied the effect of COPB2 deficiency on collagen trafficking because of the critical role of collagen secretion in bone biology. COPB2 siRNA-treated fibroblasts showed delayed collagen secretion with retention of type I collagen in the ER and Golgi and altered distribution of Golgi markers. copb2-null zebrafish embryos showed retention of type II collagen, disorganization of the ER and Golgi, and early larval lethality. Copb2+/- mice exhibited low bone mass, and consistent with the findings in human cells and zebrafish, studies in Copb2+/- mouse fibroblasts suggest ER stress and a Golgi defect. Interestingly, ascorbic acid treatment partially rescued the zebrafish developmental phenotype and the cellular phenotype in Copb2+/- mouse fibroblasts. This work identifies a form of coatopathy due to COPB2 haploinsufficiency, explores a potential therapeutic approach for this disorder, and highlights the role of the COPI complex as a regulator of skeletal homeostasis.
Asunto(s)
Huesos/metabolismo , Proteína Coat de Complejo I/genética , Proteína Coatómero/genética , Discapacidades del Desarrollo/genética , Discapacidad Intelectual/genética , Osteoporosis/genética , Animales , Ácido Ascórbico/farmacología , Huesos/efectos de los fármacos , Huesos/patología , Encéfalo/diagnóstico por imagen , Encéfalo/efectos de los fármacos , Encéfalo/metabolismo , Encéfalo/patología , Niño , Preescolar , Proteína Coat de Complejo I/deficiencia , Proteína Coatómero/química , Proteína Coatómero/deficiencia , Colágeno Tipo I/genética , Colágeno Tipo I/metabolismo , Discapacidades del Desarrollo/diagnóstico por imagen , Discapacidades del Desarrollo/metabolismo , Discapacidades del Desarrollo/patología , Embrión no Mamífero , Retículo Endoplásmico/efectos de los fármacos , Retículo Endoplásmico/metabolismo , Retículo Endoplásmico/patología , Femenino , Fibroblastos/efectos de los fármacos , Fibroblastos/metabolismo , Fibroblastos/patología , Regulación del Desarrollo de la Expresión Génica , Aparato de Golgi , Haploinsuficiencia , Humanos , Discapacidad Intelectual/diagnóstico por imagen , Discapacidad Intelectual/metabolismo , Discapacidad Intelectual/patología , Masculino , Ratones , Osteoporosis/tratamiento farmacológico , Osteoporosis/metabolismo , Osteoporosis/patología , ARN Interferente Pequeño/genética , ARN Interferente Pequeño/metabolismo , Índice de Severidad de la Enfermedad , Pez CebraRESUMEN
Neurodevelopmental disorders (NDDs) are clinically and genetically heterogenous; many such disorders are secondary to perturbation in brain development and/or function. The prevalence of NDDs is > 3%, resulting in significant sociocultural and economic challenges to society. With recent advances in family-based genomics, rare-variant analyses, and further exploration of the Clan Genomics hypothesis, there has been a logarithmic explosion in neurogenetic "disease-associated genes" molecular etiology and biology of NDDs; however, the majority of NDDs remain molecularly undiagnosed. We applied genome-wide screening technologies, including exome sequencing (ES) and whole-genome sequencing (WGS), to identify the molecular etiology of 234 newly enrolled subjects and 20 previously unsolved Turkish NDD families. In 176 of the 234 studied families (75.2%), a plausible and genetically parsimonious molecular etiology was identified. Out of 176 solved families, deleterious variants were identified in 218 distinct genes, further documenting the enormous genetic heterogeneity and diverse perturbations in human biology underlying NDDs. We propose 86 candidate disease-trait-associated genes for an NDD phenotype. Importantly, on the basis of objective and internally established variant prioritization criteria, we identified 51 families (51/176 = 28.9%) with multilocus pathogenic variation (MPV), mostly driven by runs of homozygosity (ROHs) - reflecting genomic segments/haplotypes that are identical-by-descent. Furthermore, with the use of additional bioinformatic tools and expansion of ES to additional family members, we established a molecular diagnosis in 5 out of 20 families (25%) who remained undiagnosed in our previously studied NDD cohort emanating from Turkey.
Asunto(s)
Genómica/métodos , Mutación , Trastornos del Neurodesarrollo/epidemiología , Fenotipo , Adolescente , Adulto , Niño , Preescolar , Estudios de Cohortes , Femenino , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Trastornos del Neurodesarrollo/genética , Trastornos del Neurodesarrollo/patología , Linaje , Prevalencia , Turquía/epidemiología , Secuenciación del Exoma , Adulto JovenRESUMEN
The development of the microbiome from infancy to childhood is dependent on a range of factors, with microbial-immune crosstalk during this time thought to be involved in the pathobiology of later life diseases1-9 such as persistent islet autoimmunity and type 1 diabetes10-12. However, to our knowledge, no studies have performed extensive characterization of the microbiome in early life in a large, multi-centre population. Here we analyse longitudinal stool samples from 903 children between 3 and 46 months of age by 16S rRNA gene sequencing (n = 12,005) and metagenomic sequencing (n = 10,867), as part of the The Environmental Determinants of Diabetes in the Young (TEDDY) study. We show that the developing gut microbiome undergoes three distinct phases of microbiome progression: a developmental phase (months 3-14), a transitional phase (months 15-30), and a stable phase (months 31-46). Receipt of breast milk, either exclusive or partial, was the most significant factor associated with the microbiome structure. Breastfeeding was associated with higher levels of Bifidobacterium species (B. breve and B. bifidum), and the cessation of breast milk resulted in faster maturation of the gut microbiome, as marked by the phylum Firmicutes. Birth mode was also significantly associated with the microbiome during the developmental phase, driven by higher levels of Bacteroides species (particularly B. fragilis) in infants delivered vaginally. Bacteroides was also associated with increased gut diversity and faster maturation, regardless of the birth mode. Environmental factors including geographical location and household exposures (such as siblings and furry pets) also represented important covariates. A nested case-control analysis revealed subtle associations between microbial taxonomy and the development of islet autoimmunity or type 1 diabetes. These data determine the structural and functional assembly of the microbiome in early life and provide a foundation for targeted mechanistic investigation into the consequences of microbial-immune crosstalk for long-term health.
Asunto(s)
Microbioma Gastrointestinal/inmunología , Microbioma Gastrointestinal/fisiología , Encuestas y Cuestionarios , Adolescente , Animales , Bifidobacterium/clasificación , Bifidobacterium/genética , Bifidobacterium/aislamiento & purificación , Lactancia Materna/estadística & datos numéricos , Estudios de Casos y Controles , Niño , Preescolar , Análisis por Conglomerados , Conjuntos de Datos como Asunto , Diabetes Mellitus Tipo 1/inmunología , Diabetes Mellitus Tipo 1/microbiología , Femenino , Firmicutes/clasificación , Firmicutes/genética , Firmicutes/aislamiento & purificación , Microbioma Gastrointestinal/genética , Humanos , Lactante , Masculino , Leche Humana/inmunología , Leche Humana/microbiología , Mascotas , ARN Ribosómico 16S/genética , Hermanos , Factores de TiempoRESUMEN
In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3-4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS.
Asunto(s)
Síndrome de Inmunodeficiencia Adquirida/genética , Cercocebus atys/genética , Cercocebus atys/virología , Predisposición Genética a la Enfermedad , Genoma/genética , Especificidad del Huésped/genética , Virus de la Inmunodeficiencia de los Simios , Síndrome de Inmunodeficiencia Adquirida/virología , Secuencia de Aminoácidos , Animales , Moléculas de Adhesión Celular/química , Moléculas de Adhesión Celular/genética , Moléculas de Adhesión Celular/metabolismo , Cercocebus atys/inmunología , Exones/genética , Femenino , Mutación del Sistema de Lectura/genética , Variación Genética , Genómica , VIH/patogenicidad , Humanos , Macaca/virología , Eliminación de Secuencia , Síndrome de Inmunodeficiencia Adquirida del Simio/genética , Síndrome de Inmunodeficiencia Adquirida del Simio/virología , Virus de la Inmunodeficiencia de los Simios/patogenicidad , Especificidad de la Especie , Receptor Toll-Like 4/química , Receptor Toll-Like 4/genética , Receptor Toll-Like 4/inmunología , Transcriptoma/genética , Secuenciación Completa del GenomaRESUMEN
Whole-genome sequencing (WGS) can improve assessment of low-frequency and rare variants, particularly in non-European populations that have been underrepresented in existing genomic studies. The genetic determinants of C-reactive protein (CRP), a biomarker of chronic inflammation, have been extensively studied, with existing genome-wide association studies (GWASs) conducted in >200,000 individuals of European ancestry. In order to discover novel loci associated with CRP levels, we examined a multi-ancestry population (n = 23,279) with WGS (â¼38× coverage) from the Trans-Omics for Precision Medicine (TOPMed) program. We found evidence for eight distinct associations at the CRP locus, including two variants that have not been identified previously (rs11265259 and rs181704186), both of which are non-coding and more common in individuals of African ancestry (â¼10% and â¼1% minor allele frequency, respectively, and rare or monomorphic in 1000 Genomes populations of East Asian, South Asian, and European ancestry). We show that the minor (G) allele of rs181704186 is associated with lower CRP levels and decreased transcriptional activity and protein binding in vitro, providing a plausible molecular mechanism for this African ancestry-specific signal. The individuals homozygous for rs181704186-G have a mean CRP level of 0.23 mg/L, in contrast to individuals heterozygous for rs181704186 with mean CRP of 2.97 mg/L and major allele homozygotes with mean CRP of 4.11 mg/L. This study demonstrates the utility of WGS in multi-ethnic populations to drive discovery of complex trait associations of large effect and to identify functional alleles in noncoding regulatory regions.
Asunto(s)
Pueblo Asiatico/genética , Población Negra/genética , Proteína C-Reactiva/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Población Blanca/genética , Secuenciación Completa del Genoma/métodos , Estudios de Cohortes , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de LigamientoRESUMEN
Mutation is the ultimate source of all genetic novelty and the cause of heritable genetic disorders. Mutational burden has been linked to complex disease, including neurodevelopmental disorders such as schizophrenia and autism. The rate of mutation is a fundamental genomic parameter and direct estimates of this parameter have been enabled by accurate comparisons of whole-genome sequences between parents and offspring. Studies in humans have revealed that the paternal age at conception explains most of the variation in mutation rate: Each additional year of paternal age in humans leads to approximately 1.5 additional inherited mutations. Here, we present an estimate of the de novo mutation rate in the rhesus macaque (Macaca mulatta) using whole-genome sequence data from 32 individuals in four large pedigrees. We estimated an average mutation rate of 0.58 × 10-8 per base pair per generation (at an average parental age of 7.5 yr), much lower than found in direct estimates from great apes. As in humans, older macaque fathers transmit more mutations to their offspring, increasing the per generation mutation rate by 4.27 × 10-10 per base pair per year. We found that the rate of mutation accumulation after puberty is similar between macaques and humans, but that a smaller number of mutations accumulate before puberty in macaques. We additionally investigated the role of paternal age on offspring sociability, a proxy for normal neurodevelopment, by studying 203 male macaques in large social groups.
Asunto(s)
Conducta Animal , Mutación de Línea Germinal , Acumulación de Mutaciones , Edad Paterna , Efectos Tardíos de la Exposición Prenatal/genética , Habilidades Sociales , Factores de Edad , Animales , Femenino , Humanos , Macaca mulatta , Masculino , Tasa de Mutación , Embarazo , Especificidad de la EspecieRESUMEN
Studies of Y Chromosome evolution have focused primarily on gene decay, a consequence of suppression of crossing-over with the X Chromosome. Here, we provide evidence that suppression of X-Y crossing-over unleashed a second dynamic: selfish X-Y arms races that reshaped the sex chromosomes in mammals as different as cattle, mice, and men. Using super-resolution sequencing, we explore the Y Chromosome of Bos taurus (bull) and find it to be dominated by massive, lineage-specific amplification of testis-expressed gene families, making it the most gene-dense Y Chromosome sequenced to date. As in mice, an X-linked homolog of a bull Y-amplified gene has become testis-specific and amplified. This evolutionary convergence implies that lineage-specific X-Y coevolution through gene amplification, and the selfish forces underlying this phenomenon, were dominatingly powerful among diverse mammalian lineages. Together with Y gene decay, X-Y arms races molded mammalian sex chromosomes and influenced the course of mammalian evolution.