Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 20 de 66
Filtrar
1.
Cell ; 185(18): 3426-3440.e19, 2022 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-36055201

RESUMEN

The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina. We performed single-nucleotide variant (SNV) and short insertion and deletion (INDEL) discovery and generated a comprehensive set of structural variants (SVs) by integrating multiple analytic methods through a machine learning model. We show gains in sensitivity and precision of variant calls compared to phase 3, especially among rare SNVs as well as INDELs and SVs spanning frequency spectrum. We also generated an improved reference imputation panel, making variants discovered here accessible for association studies.


Asunto(s)
Genoma Humano , Secuenciación Completa del Genoma , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación INDEL , Masculino , Polimorfismo de Nucleótido Simple
2.
Cell ; 185(16): 3041-3055.e25, 2022 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-35917817

RESUMEN

Rare copy-number variants (rCNVs) include deletions and duplications that occur infrequently in the global human population and can confer substantial risk for disease. In this study, we aimed to quantify the properties of haploinsufficiency (i.e., deletion intolerance) and triplosensitivity (i.e., duplication intolerance) throughout the human genome. We harmonized and meta-analyzed rCNVs from nearly one million individuals to construct a genome-wide catalog of dosage sensitivity across 54 disorders, which defined 163 dosage sensitive segments associated with at least one disorder. These segments were typically gene dense and often harbored dominant dosage sensitive driver genes, which we were able to prioritize using statistical fine-mapping. Finally, we designed an ensemble machine-learning model to predict probabilities of dosage sensitivity (pHaplo & pTriplo) for all autosomal genes, which identified 2,987 haploinsufficient and 1,559 triplosensitive genes, including 648 that were uniquely triplosensitive. This dosage sensitivity resource will provide broad utility for human disease research and clinical genetics.


Asunto(s)
Variaciones en el Número de Copia de ADN , Genoma Humano , Variaciones en el Número de Copia de ADN/genética , Dosificación de Gen , Haploinsuficiencia/genética , Humanos
3.
Cell ; 180(3): 568-584.e23, 2020 02 06.
Artículo en Inglés | MEDLINE | ID: mdl-31981491

RESUMEN

We present the largest exome sequencing study of autism spectrum disorder (ASD) to date (n = 35,584 total samples, 11,986 with ASD). Using an enhanced analytical framework to integrate de novo and case-control rare variation, we identify 102 risk genes at a false discovery rate of 0.1 or less. Of these genes, 49 show higher frequencies of disruptive de novo variants in individuals ascertained to have severe neurodevelopmental delay, whereas 53 show higher frequencies in individuals ascertained to have ASD; comparing ASD cases with mutations in these groups reveals phenotypic differences. Expressed early in brain development, most risk genes have roles in regulation of gene expression or neuronal communication (i.e., mutations effect neurodevelopmental and neurophysiological changes), and 13 fall within loci recurrently hit by copy number variants. In cells from the human cortex, expression of risk genes is enriched in excitatory and inhibitory neuronal lineages, consistent with multiple paths to an excitatory-inhibitory imbalance underlying ASD.


Asunto(s)
Trastorno Autístico/genética , Corteza Cerebral/crecimiento & desarrollo , Secuenciación del Exoma/métodos , Regulación del Desarrollo de la Expresión Génica , Neurobiología/métodos , Estudios de Casos y Controles , Linaje de la Célula , Estudios de Cohortes , Exoma , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Humanos , Masculino , Mutación Missense , Neuronas/metabolismo , Fenotipo , Factores Sexuales , Análisis de la Célula Individual/métodos
4.
Cell ; 172(5): 897-909.e21, 2018 02 22.
Artículo en Inglés | MEDLINE | ID: mdl-29474918

RESUMEN

X-linked Dystonia-Parkinsonism (XDP) is a Mendelian neurodegenerative disease that is endemic to the Philippines and is associated with a founder haplotype. We integrated multiple genome and transcriptome assembly technologies to narrow the causal mutation to the TAF1 locus, which included a SINE-VNTR-Alu (SVA) retrotransposition into intron 32 of the gene. Transcriptome analyses identified decreased expression of the canonical cTAF1 transcript among XDP probands, and de novo assembly across multiple pluripotent stem-cell-derived neuronal lineages discovered aberrant TAF1 transcription that involved alternative splicing and intron retention (IR) in proximity to the SVA that was anti-correlated with overall TAF1 expression. CRISPR/Cas9 excision of the SVA rescued this XDP-specific transcriptional signature and normalized TAF1 expression in probands. These data suggest an SVA-mediated aberrant transcriptional mechanism associated with XDP and may provide a roadmap for layered technologies and integrated assembly-based analyses for other unsolved Mendelian disorders.


Asunto(s)
Trastornos Distónicos/genética , Enfermedades Genéticas Ligadas al Cromosoma X/genética , Genoma Humano , Transcriptoma/genética , Empalme Alternativo/genética , Elementos Alu/genética , Secuencia de Bases , Sistemas CRISPR-Cas/genética , Estudios de Cohortes , Familia , Femenino , Sitios Genéticos , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Histona Acetiltransferasas/genética , Histona Acetiltransferasas/metabolismo , Humanos , Células Madre Pluripotentes Inducidas/metabolismo , Intrones/genética , Masculino , Repeticiones de Minisatélite/genética , Modelos Genéticos , Degeneración Nerviosa/genética , Degeneración Nerviosa/patología , Células-Madre Neurales/metabolismo , Neuronas/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Elementos de Nucleótido Esparcido Corto , Factores Asociados con la Proteína de Unión a TATA/genética , Factores Asociados con la Proteína de Unión a TATA/metabolismo , Factor de Transcripción TFIID/genética , Factor de Transcripción TFIID/metabolismo
5.
Genome Res ; 34(5): 796-809, 2024 06 25.
Artículo en Inglés | MEDLINE | ID: mdl-38749656

RESUMEN

Underrepresented populations are often excluded from genomic studies owing in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high-quality set of 4094 whole genomes from 80 populations in the HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also show substantial added value from this data set compared with the prior versions of the component resources, typically combined via liftOver and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared with previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality-control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Humanos , Proyecto Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Variación Genética , Genómica/métodos
6.
Am J Hum Genet ; 110(10): 1787-1803, 2023 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-37751738

RESUMEN

Congenital diaphragmatic hernia (CDH) is a relatively common and genetically heterogeneous structural birth defect associated with high mortality and morbidity. We describe eight unrelated families with an X-linked condition characterized by diaphragm defects, variable anterior body-wall anomalies, and/or facial dysmorphism. Using linkage analysis and exome or genome sequencing, we found that missense variants in plastin 3 (PLS3), a gene encoding an actin bundling protein, co-segregate with disease in all families. Loss-of-function variants in PLS3 have been previously associated with X-linked osteoporosis (MIM: 300910), so we used in silico protein modeling and a mouse model to address these seemingly disparate clinical phenotypes. The missense variants in individuals with CDH are located within the actin-binding domains of the protein but are not predicted to affect protein structure, whereas the variants in individuals with osteoporosis are predicted to result in loss of function. A mouse knockin model of a variant identified in one of the CDH-affected families, c.1497G>C (p.Trp499Cys), shows partial perinatal lethality and recapitulates the key findings of the human phenotype, including diaphragm and abdominal-wall defects. Both the mouse model and one adult human male with a CDH-associated PLS3 variant were observed to have increased rather than decreased bone mineral density. Together, these clinical and functional data in humans and mice reveal that specific missense variants affecting the actin-binding domains of PLS3 might have a gain-of-function effect and cause a Mendelian congenital disorder.


Asunto(s)
Hernias Diafragmáticas Congénitas , Osteoporosis , Adulto , Humanos , Masculino , Animales , Ratones , Hernias Diafragmáticas Congénitas/genética , Actinas/genética , Mutación Missense/genética , Osteoporosis/genética
7.
Am J Hum Genet ; 110(9): 1454-1469, 2023 09 07.
Artículo en Inglés | MEDLINE | ID: mdl-37595579

RESUMEN

Short-read genome sequencing (GS) holds the promise of becoming the primary diagnostic approach for the assessment of autism spectrum disorder (ASD) and fetal structural anomalies (FSAs). However, few studies have comprehensively evaluated its performance against current standard-of-care diagnostic tests: karyotype, chromosomal microarray (CMA), and exome sequencing (ES). To assess the clinical utility of GS, we compared its diagnostic yield against these three tests in 1,612 quartet families including an individual with ASD and in 295 prenatal families. Our GS analytic framework identified a diagnostic variant in 7.8% of ASD probands, almost 2-fold more than CMA (4.3%) and 3-fold more than ES (2.7%). However, when we systematically captured copy-number variants (CNVs) from the exome data, the diagnostic yield of ES (7.4%) was brought much closer to, but did not surpass, GS. Similarly, we estimated that GS could achieve an overall diagnostic yield of 46.1% in unselected FSAs, representing a 17.2% increased yield over karyotype, 14.1% over CMA, and 4.1% over ES with CNV calling or 36.1% increase without CNV discovery. Overall, GS provided an added diagnostic yield of 0.4% and 0.8% beyond the combination of all three standard-of-care tests in ASD and FSAs, respectively. This corresponded to nine GS unique diagnostic variants, including sequence variants in exons not captured by ES, structural variants (SVs) inaccessible to existing standard-of-care tests, and SVs where the resolution of GS changed variant classification. Overall, this large-scale evaluation demonstrated that GS significantly outperforms each individual standard-of-care test while also outperforming the combination of all three tests, thus warranting consideration as the first-tier diagnostic approach for the assessment of ASD and FSAs.


Asunto(s)
Trastorno del Espectro Autista , Femenino , Embarazo , Humanos , Trastorno del Espectro Autista/diagnóstico , Trastorno del Espectro Autista/genética , Primer Trimestre del Embarazo , Ultrasonografía Prenatal , Mapeo Cromosómico , Exoma
8.
Nature ; 581(7809): 444-451, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32461652

RESUMEN

Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.


Asunto(s)
Enfermedad/genética , Variación Genética , Genética Médica/normas , Genética de Población/normas , Genoma Humano/genética , Femenino , Pruebas Genéticas , Técnicas de Genotipaje , Humanos , Masculino , Persona de Mediana Edad , Mutación , Polimorfismo de Nucleótido Simple/genética , Grupos Raciales/genética , Estándares de Referencia , Selección Genética , Secuenciación Completa del Genoma
9.
Nature ; 581(7809): 434-443, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32461654

RESUMEN

Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.


Asunto(s)
Exoma/genética , Genes Esenciales/genética , Variación Genética/genética , Genoma Humano/genética , Adulto , Encéfalo/metabolismo , Enfermedades Cardiovasculares/genética , Estudios de Cohortes , Bases de Datos Genéticas , Femenino , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Humanos , Mutación con Pérdida de Función/genética , Masculino , Tasa de Mutación , Proproteína Convertasa 9/genética , ARN Mensajero/genética , Reproducibilidad de los Resultados , Secuenciación del Exoma , Secuenciación Completa del Genoma
10.
Hum Genet ; 2024 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-39361040

RESUMEN

Structural birth defects affect 3-4% of all live births and, depending on the type, tend to manifest in a sex-biased manner. Orofacial clefts (OFCs) are the most common craniofacial structural birth defects and are often divided into cleft lip with or without cleft palate (CL/P) and cleft palate only (CP). Previous studies have found sex-specific risks for CL/P, but these risks have yet to be evaluated in CP. CL/P is more common in males and CP is more frequently observed in females, so we hypothesized there would also be sex-specific differences for CP. Using a trio-based cohort, we performed sex-stratified genome-wide association studies (GWAS) based on proband sex followed by a genome-wide gene-by-sex (G × S) interaction testing. There were 13 loci significant for G × S interactions, with the top finding in LTBP1 (RR = 3.37 [2.04-5.56], p = 1.93 × 10-6). LTBP1 plays a role in regulating TGF-ß bioavailability, and knockdown in both mice and zebrafish lead to craniofacial anomalies. Further, there is evidence for differential expression of LTBP1 between males and females in both mice and humans. Therefore, we tested the association between the imputed genetically regulated gene expression of genes with significant G × S interactions and the CP phenotype. We found significant association for LTBP1 in cell cultured fibroblasts in female probands (p = 0.0013) but not in males. Taken altogether, we show there are sex-specific risks for CP that are otherwise undetectable in a combined sex cohort, and LTBP1 is a candidate risk gene, particularly in females.

SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda