Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 137
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 185(18): 3426-3440.e19, 2022 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-36055201

RESUMEN

The 1000 Genomes Project (1kGP) is the largest fully open resource of whole-genome sequencing (WGS) data consented for public distribution without access or use restrictions. The final, phase 3 release of the 1kGP included 2,504 unrelated samples from 26 populations and was based primarily on low-coverage WGS. Here, we present a high-coverage 3,202-sample WGS 1kGP resource, which now includes 602 complete trios, sequenced to a depth of 30X using Illumina. We performed single-nucleotide variant (SNV) and short insertion and deletion (INDEL) discovery and generated a comprehensive set of structural variants (SVs) by integrating multiple analytic methods through a machine learning model. We show gains in sensitivity and precision of variant calls compared to phase 3, especially among rare SNVs as well as INDELs and SVs spanning frequency spectrum. We also generated an improved reference imputation panel, making variants discovered here accessible for association studies.


Asunto(s)
Genoma Humano , Secuenciación Completa del Genoma , Femenino , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación INDEL , Masculino , Polimorfismo de Nucleótido Simple
2.
Cell ; 185(16): 3041-3055.e25, 2022 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-35917817

RESUMEN

Rare copy-number variants (rCNVs) include deletions and duplications that occur infrequently in the global human population and can confer substantial risk for disease. In this study, we aimed to quantify the properties of haploinsufficiency (i.e., deletion intolerance) and triplosensitivity (i.e., duplication intolerance) throughout the human genome. We harmonized and meta-analyzed rCNVs from nearly one million individuals to construct a genome-wide catalog of dosage sensitivity across 54 disorders, which defined 163 dosage sensitive segments associated with at least one disorder. These segments were typically gene dense and often harbored dominant dosage sensitive driver genes, which we were able to prioritize using statistical fine-mapping. Finally, we designed an ensemble machine-learning model to predict probabilities of dosage sensitivity (pHaplo & pTriplo) for all autosomal genes, which identified 2,987 haploinsufficient and 1,559 triplosensitive genes, including 648 that were uniquely triplosensitive. This dosage sensitivity resource will provide broad utility for human disease research and clinical genetics.


Asunto(s)
Variaciones en el Número de Copia de ADN , Genoma Humano , Variaciones en el Número de Copia de ADN/genética , Dosificación de Gen , Haploinsuficiencia/genética , Humanos
3.
Cell ; 180(3): 568-584.e23, 2020 02 06.
Artículo en Inglés | MEDLINE | ID: mdl-31981491

RESUMEN

We present the largest exome sequencing study of autism spectrum disorder (ASD) to date (n = 35,584 total samples, 11,986 with ASD). Using an enhanced analytical framework to integrate de novo and case-control rare variation, we identify 102 risk genes at a false discovery rate of 0.1 or less. Of these genes, 49 show higher frequencies of disruptive de novo variants in individuals ascertained to have severe neurodevelopmental delay, whereas 53 show higher frequencies in individuals ascertained to have ASD; comparing ASD cases with mutations in these groups reveals phenotypic differences. Expressed early in brain development, most risk genes have roles in regulation of gene expression or neuronal communication (i.e., mutations effect neurodevelopmental and neurophysiological changes), and 13 fall within loci recurrently hit by copy number variants. In cells from the human cortex, expression of risk genes is enriched in excitatory and inhibitory neuronal lineages, consistent with multiple paths to an excitatory-inhibitory imbalance underlying ASD.


Asunto(s)
Trastorno Autístico/genética , Corteza Cerebral/crecimiento & desarrollo , Secuenciación del Exoma/métodos , Regulación del Desarrollo de la Expresión Génica , Neurobiología/métodos , Estudios de Casos y Controles , Linaje de la Célula , Estudios de Cohortes , Exoma , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Humanos , Masculino , Mutación Missense , Neuronas/metabolismo , Fenotipo , Factores Sexuales , Análisis de la Célula Individual/métodos
4.
Cell ; 172(5): 897-909.e21, 2018 02 22.
Artículo en Inglés | MEDLINE | ID: mdl-29474918

RESUMEN

X-linked Dystonia-Parkinsonism (XDP) is a Mendelian neurodegenerative disease that is endemic to the Philippines and is associated with a founder haplotype. We integrated multiple genome and transcriptome assembly technologies to narrow the causal mutation to the TAF1 locus, which included a SINE-VNTR-Alu (SVA) retrotransposition into intron 32 of the gene. Transcriptome analyses identified decreased expression of the canonical cTAF1 transcript among XDP probands, and de novo assembly across multiple pluripotent stem-cell-derived neuronal lineages discovered aberrant TAF1 transcription that involved alternative splicing and intron retention (IR) in proximity to the SVA that was anti-correlated with overall TAF1 expression. CRISPR/Cas9 excision of the SVA rescued this XDP-specific transcriptional signature and normalized TAF1 expression in probands. These data suggest an SVA-mediated aberrant transcriptional mechanism associated with XDP and may provide a roadmap for layered technologies and integrated assembly-based analyses for other unsolved Mendelian disorders.


Asunto(s)
Trastornos Distónicos/genética , Enfermedades Genéticas Ligadas al Cromosoma X/genética , Genoma Humano , Transcriptoma/genética , Empalme Alternativo/genética , Elementos Alu/genética , Secuencia de Bases , Sistemas CRISPR-Cas/genética , Estudios de Cohortes , Familia , Femenino , Sitios Genéticos , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Histona Acetiltransferasas/genética , Histona Acetiltransferasas/metabolismo , Humanos , Células Madre Pluripotentes Inducidas/metabolismo , Intrones/genética , Masculino , Repeticiones de Minisatélite/genética , Modelos Genéticos , Degeneración Nerviosa/genética , Degeneración Nerviosa/patología , Células-Madre Neurales/metabolismo , Neuronas/metabolismo , ARN Mensajero/genética , ARN Mensajero/metabolismo , Elementos de Nucleótido Esparcido Corto , Factores Asociados con la Proteína de Unión a TATA/genética , Factores Asociados con la Proteína de Unión a TATA/metabolismo , Factor de Transcripción TFIID/genética , Factor de Transcripción TFIID/metabolismo
5.
Cell ; 167(7): 1705-1718.e13, 2016 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-27984722

RESUMEN

Metformin has utility in cancer prevention and treatment, though the mechanisms for these effects remain elusive. Through genetic screening in C. elegans, we uncover two metformin response elements: the nuclear pore complex (NPC) and acyl-CoA dehydrogenase family member-10 (ACAD10). We demonstrate that biguanides inhibit growth by inhibiting mitochondrial respiratory capacity, which restrains transit of the RagA-RagC GTPase heterodimer through the NPC. Nuclear exclusion renders RagC incapable of gaining the GDP-bound state necessary to stimulate mTORC1. Biguanide-induced inactivation of mTORC1 subsequently inhibits growth through transcriptional induction of ACAD10. This ancient metformin response pathway is conserved from worms to humans. Both restricted nuclear pore transit and upregulation of ACAD10 are required for biguanides to reduce viability in melanoma and pancreatic cancer cells, and to extend C. elegans lifespan. This pathway provides a unified mechanism by which metformin kills cancer cells and extends lifespan, and illuminates potential cancer targets. PAPERCLIP.


Asunto(s)
Metformina/farmacología , Acil-CoA Deshidrogenasa/genética , Envejecimiento , Animales , Tamaño Corporal , Caenorhabditis elegans/crecimiento & desarrollo , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/metabolismo , Línea Celular Tumoral , Proteínas de Unión al ADN/metabolismo , Humanos , Longevidad , Diana Mecanicista del Complejo 1 de la Rapamicina , Mitocondrias/metabolismo , Proteínas de Unión al GTP Monoméricas/metabolismo , Complejos Multiproteicos/metabolismo , Neoplasias/tratamiento farmacológico , Poro Nuclear/metabolismo , Fosforilación Oxidativa , Transducción de Señal/efectos de los fármacos , Serina-Treonina Quinasas TOR/metabolismo , Factores de Transcripción/metabolismo
6.
Nature ; 625(7993): 92-100, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38057664

RESUMEN

The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1-4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)-the largest public open-access human genome allele frequency reference dataset-and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.


Asunto(s)
Genoma Humano , Genómica , Modelos Genéticos , Mutación , Humanos , Acceso a la Información , Bases de Datos Genéticas , Conjuntos de Datos como Asunto , Frecuencia de los Genes , Genoma Humano/genética , Mutación/genética , Selección Genética
7.
Genome Res ; 34(5): 796-809, 2024 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-38749656

RESUMEN

Underrepresented populations are often excluded from genomic studies owing in part to a lack of resources supporting their analyses. The 1000 Genomes Project (1kGP) and Human Genome Diversity Project (HGDP), which have recently been sequenced to high coverage, are valuable genomic resources because of the global diversity they capture and their open data sharing policies. Here, we harmonized a high-quality set of 4094 whole genomes from 80 populations in the HGDP and 1kGP with data from the Genome Aggregation Database (gnomAD) and identified over 153 million high-quality SNVs, indels, and SVs. We performed a detailed ancestry analysis of this cohort, characterizing population structure and patterns of admixture across populations, analyzing site frequency spectra, and measuring variant counts at global and subcontinental levels. We also show substantial added value from this data set compared with the prior versions of the component resources, typically combined via liftOver and variant intersection; for example, we catalog millions of new genetic variants, mostly rare, compared with previous releases. In addition to unrestricted individual-level public release, we provide detailed tutorials for conducting many of the most common quality-control steps and analyses with these data in a scalable cloud-computing environment and publicly release this new phased joint callset for use as a haplotype resource in phasing and imputation pipelines. This jointly called reference panel will serve as a key resource to support research of diverse ancestry populations.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Humanos , Proyecto Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Variación Genética , Genómica/métodos
8.
Cell ; 149(3): 525-37, 2012 Apr 27.
Artículo en Inglés | MEDLINE | ID: mdl-22521361

RESUMEN

Balanced chromosomal abnormalities (BCAs) represent a relatively untapped reservoir of single-gene disruptions in neurodevelopmental disorders (NDDs). We sequenced BCAs in patients with autism or related NDDs, revealing disruption of 33 loci in four general categories: (1) genes previously associated with abnormal neurodevelopment (e.g., AUTS2, FOXP1, and CDKL5), (2) single-gene contributors to microdeletion syndromes (MBD5, SATB2, EHMT1, and SNURF-SNRPN), (3) novel risk loci (e.g., CHD8, KIRREL3, and ZNF507), and (4) genes associated with later-onset psychiatric disorders (e.g., TCF4, ZNF804A, PDE10A, GRIN2B, and ANK3). We also discovered among neurodevelopmental cases a profoundly increased burden of copy-number variants from these 33 loci and a significant enrichment of polygenic risk alleles from genome-wide association studies of autism and schizophrenia. Our findings suggest a polygenic risk model of autism and reveal that some neurodevelopmental genes are sensitive to perturbation by multiple mutational mechanisms, leading to variable phenotypic outcomes that manifest at different life stages.


Asunto(s)
Trastornos Generalizados del Desarrollo Infantil/genética , Aberraciones Cromosómicas , Trastorno Autístico/diagnóstico , Trastorno Autístico/genética , Niño , Trastornos Generalizados del Desarrollo Infantil/diagnóstico , Rotura Cromosómica , Deleción Cromosómica , Variaciones en el Número de Copia de ADN , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Sistema Nervioso/crecimiento & desarrollo , Esquizofrenia/genética , Análisis de Secuencia de ADN , Transducción de Señal
9.
Am J Hum Genet ; 110(9): 1454-1469, 2023 09 07.
Artículo en Inglés | MEDLINE | ID: mdl-37595579

RESUMEN

Short-read genome sequencing (GS) holds the promise of becoming the primary diagnostic approach for the assessment of autism spectrum disorder (ASD) and fetal structural anomalies (FSAs). However, few studies have comprehensively evaluated its performance against current standard-of-care diagnostic tests: karyotype, chromosomal microarray (CMA), and exome sequencing (ES). To assess the clinical utility of GS, we compared its diagnostic yield against these three tests in 1,612 quartet families including an individual with ASD and in 295 prenatal families. Our GS analytic framework identified a diagnostic variant in 7.8% of ASD probands, almost 2-fold more than CMA (4.3%) and 3-fold more than ES (2.7%). However, when we systematically captured copy-number variants (CNVs) from the exome data, the diagnostic yield of ES (7.4%) was brought much closer to, but did not surpass, GS. Similarly, we estimated that GS could achieve an overall diagnostic yield of 46.1% in unselected FSAs, representing a 17.2% increased yield over karyotype, 14.1% over CMA, and 4.1% over ES with CNV calling or 36.1% increase without CNV discovery. Overall, GS provided an added diagnostic yield of 0.4% and 0.8% beyond the combination of all three standard-of-care tests in ASD and FSAs, respectively. This corresponded to nine GS unique diagnostic variants, including sequence variants in exons not captured by ES, structural variants (SVs) inaccessible to existing standard-of-care tests, and SVs where the resolution of GS changed variant classification. Overall, this large-scale evaluation demonstrated that GS significantly outperforms each individual standard-of-care test while also outperforming the combination of all three tests, thus warranting consideration as the first-tier diagnostic approach for the assessment of ASD and FSAs.


Asunto(s)
Trastorno del Espectro Autista , Femenino , Embarazo , Humanos , Trastorno del Espectro Autista/diagnóstico , Trastorno del Espectro Autista/genética , Primer Trimestre del Embarazo , Ultrasonografía Prenatal , Mapeo Cromosómico , Exoma
10.
Nature ; 581(7809): 444-451, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-32461652

RESUMEN

Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.


Asunto(s)
Enfermedad/genética , Variación Genética , Genética Médica/normas , Genética de Población/normas , Genoma Humano/genética , Femenino , Pruebas Genéticas , Técnicas de Genotipaje , Humanos , Masculino , Persona de Mediana Edad , Mutación , Polimorfismo de Nucleótido Simple/genética , Grupos Raciales/genética , Estándares de Referencia , Selección Genética , Secuenciación Completa del Genoma
11.
Am J Hum Genet ; 109(11): 2049-2067, 2022 11 03.
Artículo en Inglés | MEDLINE | ID: mdl-36283406

RESUMEN

Point mutations and structural variants that directly disrupt the coding sequence of MEF2C have been associated with a spectrum of neurodevelopmental disorders (NDDs). However, the impact of MEF2C haploinsufficiency on neurodevelopmental pathways and synaptic processes is not well understood, nor are the complex mechanisms that govern its regulation. To explore the functional changes associated with structural variants that alter MEF2C expression and/or regulation, we generated an allelic series of 204 isogenic human induced pluripotent stem cell (hiPSC)-derived neural stem cells and glutamatergic induced neurons. These neuronal models harbored CRISPR-engineered mutations that involved direct deletion of MEF2C or deletion of the boundary points for topologically associating domains (TADs) and chromatin loops encompassing MEF2C. Systematic profiling of mutation-specific alterations, contrasted to unedited controls that were exposed to the same guide RNAs for each edit, revealed that deletion of MEF2C caused differential expression of genes associated with neurodevelopmental pathways and synaptic function. We also discovered significant reduction in synaptic activity measured by multielectrode arrays (MEAs) in neuronal cells. By contrast, we observed robust buffering against MEF2C regulatory disruption following deletion of a distal 5q14.3 TAD and loop boundary, whereas homozygous loss of a proximal loop boundary resulted in down-regulation of MEF2C expression and reduced electrophysiological activity on MEA that was comparable to direct gene disruption. Collectively, these studies highlight the considerable functional impact of MEF2C deletion in neuronal cells and systematically characterize the complex interactions that challenge a priori predictions of regulatory consequences from structural variants that disrupt three-dimensional genome organization.


Asunto(s)
Células Madre Pluripotentes Inducidas , Células-Madre Neurales , Humanos , Genoma , Haploinsuficiencia , Factores de Transcripción MEF2/genética , Neuronas , Transcripción Genética
12.
Am J Hum Genet ; 109(10): 1789-1813, 2022 10 06.
Artículo en Inglés | MEDLINE | ID: mdl-36152629

RESUMEN

Chromosome 16p11.2 reciprocal genomic disorder, resulting from recurrent copy-number variants (CNVs), involves intellectual disability, autism spectrum disorder (ASD), and schizophrenia, but the responsible mechanisms are not known. To systemically dissect molecular effects, we performed transcriptome profiling of 350 libraries from six tissues (cortex, cerebellum, striatum, liver, brown fat, and white fat) in mouse models harboring CNVs of the syntenic 7qF3 region, as well as cellular, transcriptional, and single-cell analyses in 54 isogenic neural stem cell, induced neuron, and cerebral organoid models of CRISPR-engineered 16p11.2 CNVs. Transcriptome-wide differentially expressed genes were largely tissue-, cell-type-, and dosage-specific, although more effects were shared between deletion and duplication and across tissue than expected by chance. The broadest effects were observed in the cerebellum (2,163 differentially expressed genes), and the greatest enrichments were associated with synaptic pathways in mouse cerebellum and human induced neurons. Pathway and co-expression analyses identified energy and RNA metabolism as shared processes and enrichment for ASD-associated, loss-of-function constraint, and fragile X messenger ribonucleoprotein target gene sets. Intriguingly, reciprocal 16p11.2 dosage changes resulted in consistent decrements in neurite and electrophysiological features, and single-cell profiling of organoids showed reciprocal alterations to the proportions of excitatory and inhibitory GABAergic neurons. Changes both in neuronal ratios and in gene expression in our organoid analyses point most directly to calretinin GABAergic inhibitory neurons and the excitatory/inhibitory balance as targets of disruption that might contribute to changes in neurodevelopmental and cognitive function in 16p11.2 carriers. Collectively, our data indicate the genomic disorder involves disruption of multiple contributing biological processes and that this disruption has relative impacts that are context specific.


Asunto(s)
Trastorno del Espectro Autista , Trastornos de los Cromosomas , Discapacidad Intelectual , Animales , Trastorno del Espectro Autista/genética , Calbindina 2/genética , Corteza Cerebral , Deleción Cromosómica , Trastornos de los Cromosomas/genética , Cromosomas Humanos Par 16/genética , Variaciones en el Número de Copia de ADN , Genómica , Humanos , Discapacidad Intelectual/genética , Ratones , Neuronas , ARN
14.
Am J Hum Genet ; 108(11): 2145-2158, 2021 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-34672987

RESUMEN

Dystonia is a neurologic disorder associated with an increasingly large number of genetic variants in many genes, resulting in characteristic disturbances in volitional movement. Dissecting the relationships between these mutations and their functional outcomes is critical in understanding the pathways that drive dystonia pathogenesis. Here we established a pipeline for characterizing an allelic series of dystonia-specific mutations. We used this strategy to investigate the molecular consequences of genetic variation in THAP1, which encodes a transcription factor linked to neural differentiation. Multiple pathogenic mutations associated with dystonia cluster within distinct THAP1 functional domains and are predicted to alter DNA-binding properties and/or protein interactions differently, yet the relative impact of these varied changes on molecular signatures and neural deficits is unclear. To determine the effects of these mutations on THAP1 transcriptional activity, we engineered an allelic series of eight alterations in a common induced pluripotent stem cell background and differentiated these lines into a panel of near-isogenic neural stem cells (n = 94 lines). Transcriptome profiling followed by joint analysis of the most robust signatures across mutations identified a convergent pattern of dysregulated genes functionally related to neurodevelopment, lysosomal lipid metabolism, and myelin. On the basis of these observations, we examined mice bearing Thap1-disruptive alleles and detected significant changes in myelin gene expression and reduction of myelin structural integrity relative to control mice. These results suggest that deficits in neurodevelopment and myelination are common consequences of dystonia-associated THAP1 mutations and highlight the potential role of neuron-glial interactions in the pathogenesis of dystonia.


Asunto(s)
Proteínas Reguladoras de la Apoptosis/genética , Proteínas de Unión al ADN/genética , Distonía/genética , Trastornos Distónicos/genética , Mutación , Vaina de Mielina/genética , Alelos , Animales , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Humanos , Ratones
15.
Am J Hum Genet ; 108(4): 597-607, 2021 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-33675682

RESUMEN

Each human genome includes de novo mutations that arose during gametogenesis. While these germline mutations represent a fundamental source of new genetic diversity, they can also create deleterious alleles that impact fitness. Whereas the rate and patterns of point mutations in the human germline are now well understood, far less is known about the frequency and features that impact de novo structural variants (dnSVs). We report a family-based study of germline mutations among 9,599 human genomes from 33 multigenerational CEPH-Utah families and 2,384 families from the Simons Foundation Autism Research Initiative. We find that de novo structural mutations detected by alignment-based, short-read WGS occur at an overall rate of at least 0.160 events per genome in unaffected individuals, and we observe a significantly higher rate (0.206 per genome) in ASD-affected individuals. In both probands and unaffected samples, nearly 73% of de novo structural mutations arose in paternal gametes, and we predict most de novo structural mutations to be caused by mutational mechanisms that do not require sequence homology. After multiple testing correction, we did not observe a statistically significant correlation between parental age and the rate of de novo structural variation in offspring. These results highlight that a spectrum of mutational mechanisms contribute to germline structural mutations and that these mechanisms most likely have markedly different rates and selective pressures than those leading to point mutations.


Asunto(s)
Familia , Genoma Humano/genética , Células Germinativas , Mutación de Línea Germinal/genética , Tasa de Mutación , Envejecimiento/genética , Trastorno Autístico/genética , Sesgo , Variaciones en el Número de Copia de ADN/genética , Análisis Mutacional de ADN , Femenino , Humanos , Masculino , Edad Paterna , Mutación Puntual/genética
16.
Am J Hum Genet ; 108(5): 919-928, 2021 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-33789087

RESUMEN

Virtually all genome sequencing efforts in national biobanks, complex and Mendelian disease programs, and medical genetic initiatives are reliant upon short-read whole-genome sequencing (srWGS), which presents challenges for the detection of structural variants (SVs) relative to emerging long-read WGS (lrWGS) technologies. Given this ubiquity of srWGS in large-scale genomics initiatives, we sought to establish expectations for routine SV detection from this data type by comparison with lrWGS assembly, as well as to quantify the genomic properties and added value of SVs uniquely accessible to each technology. Analyses from the Human Genome Structural Variation Consortium (HGSVC) of three families captured ~11,000 SVs per genome from srWGS and ~25,000 SVs per genome from lrWGS assembly. Detection power and precision for SV discovery varied dramatically by genomic context and variant class: 9.7% of the current GRCh38 reference is defined by segmental duplication (SD) and simple repeat (SR), yet 91.4% of deletions that were specifically discovered by lrWGS localized to these regions. Across the remaining 90.3% of reference sequence, we observed extremely high (93.8%) concordance between technologies for deletions in these datasets. In contrast, lrWGS was superior for detection of insertions across all genomic contexts. Given that non-SD/SR sequences encompass 95.9% of currently annotated disease-associated exons, improved sensitivity from lrWGS to discover novel pathogenic deletions in these currently interpretable genomic regions is likely to be incremental. However, these analyses highlight the considerable added value of assembly-based lrWGS to create new catalogs of insertions and transposable elements, as well as disease-associated repeat expansions in genomic sequences that were previously recalcitrant to routine assessment.


Asunto(s)
Genoma Humano/genética , Variación Estructural del Genoma , Genómica/métodos , Objetivos , Secuenciación Completa del Genoma/métodos , Secuenciación Completa del Genoma/normas , Variaciones en el Número de Copia de ADN , Exones/genética , Humanos , Proyectos de Investigación , Duplicaciones Segmentarias en el Genoma , Alineación de Secuencia
17.
Genet Med ; 26(5): 101076, 2024 05.
Artículo en Inglés | MEDLINE | ID: mdl-38258669

RESUMEN

PURPOSE: Genome sequencing (GS)-specific diagnostic rates in prospective tightly ascertained exome sequencing (ES)-negative intellectual disability (ID) cohorts have not been reported extensively. METHODS: ES, GS, epigenetic signatures, and long-read sequencing diagnoses were assessed in 74 trios with at least moderate ID. RESULTS: The ES diagnostic yield was 42 of 74 (57%). GS diagnoses were made in 9 of 32 (28%) ES-unresolved families. Repeated ES with a contemporary pipeline on the GS-diagnosed families identified 8 of 9 single-nucleotide variations/copy-number variations undetected in older ES, confirming a GS-unique diagnostic rate of 1 in 32 (3%). Episignatures contributed diagnostic information in 9% with GS corroboration in 1 of 32 (3%) and diagnostic clues in 2 of 32 (6%). A genetic etiology for ID was detected in 51 of 74 (69%) families. Twelve candidate disease genes were identified. Contemporary ES followed by GS cost US$4976 (95% CI: $3704; $6969) per diagnosis and first-line GS at a cost of $7062 (95% CI: $6210; $8475) per diagnosis. CONCLUSION: Performing GS only in ID trios would be cost equivalent to ES if GS were available at $2435, about a 60% reduction from current prices. This study demonstrates that first-line GS achieves higher diagnostic rate than contemporary ES but at a higher cost.


Asunto(s)
Secuenciación del Exoma , Exoma , Discapacidad Intelectual , Humanos , Discapacidad Intelectual/genética , Discapacidad Intelectual/diagnóstico , Masculino , Femenino , Exoma/genética , Secuenciación del Exoma/economía , Estudios de Cohortes , Pruebas Genéticas/economía , Pruebas Genéticas/métodos , Secuenciación Completa del Genoma/economía , Niño , Genoma Humano/genética , Variaciones en el Número de Copia de ADN/genética , Polimorfismo de Nucleótido Simple/genética , Preescolar
18.
Ann Neurol ; 93(5): 1012-1022, 2023 05.
Artículo en Inglés | MEDLINE | ID: mdl-36695634

RESUMEN

OBJECTIVE: Identification of genetic risk factors for Parkinson disease (PD) has to date been primarily limited to the study of single nucleotide variants, which only represent a small fraction of the genetic variation in the human genome. Consequently, causal variants for most PD risk are not known. Here we focused on structural variants (SVs), which represent a major source of genetic variation in the human genome. We aimed to discover SVs associated with PD risk by performing the first large-scale characterization of SVs in PD. METHODS: We leveraged a recently developed computational pipeline to detect and genotype SVs from 7,772 Illumina short-read whole genome sequencing samples. Using this set of SV variants, we performed a genome-wide association study using 2,585 cases and 2,779 controls and identified SVs associated with PD risk. Furthermore, to validate the presence of these variants, we generated a subset of matched whole-genome long-read sequencing data. RESULTS: We genotyped and tested 3,154 common SVs, representing over 412 million nucleotides of previously uncatalogued genetic variation. Using long-read sequencing data, we validated the presence of three novel deletion SVs that are associated with risk of PD from our initial association analysis, including a 2 kb intronic deletion within the gene LRRN4. INTERPRETATION: We identified three SVs associated with genetic risk of PD. This study represents the most comprehensive assessment of the contribution of SVs to the genetic risk of PD to date. ANN NEUROL 2023;93:1012-1022.


Asunto(s)
Estudio de Asociación del Genoma Completo , Enfermedad de Parkinson , Humanos , Enfermedad de Parkinson/genética , Genoma Humano , Secuenciación Completa del Genoma , Genotipo
19.
Nature ; 559(7714): 350-355, 2018 07.
Artículo en Inglés | MEDLINE | ID: mdl-29995854

RESUMEN

The selective pressures that shape clonal evolution in healthy individuals are largely unknown. Here we investigate 8,342 mosaic chromosomal alterations, from 50 kb to 249 Mb long, that we uncovered in blood-derived DNA from 151,202 UK Biobank participants using phase-based computational techniques (estimated false discovery rate, 6-9%). We found six loci at which inherited variants associated strongly with the acquisition of deletions or loss of heterozygosity in cis. At three such loci (MPL, TM2D3-TARSL2, and FRA10B), we identified a likely causal variant that acted with high penetrance (5-50%). Inherited alleles at one locus appeared to affect the probability of somatic mutation, and at three other loci to be objects of positive or negative clonal selection. Several specific mosaic chromosomal alterations were strongly associated with future haematological malignancies. Our results reveal a multitude of paths towards clonal expansions with a wide range of effects on human health.


Asunto(s)
Aberraciones Cromosómicas , Células Clonales/citología , Células Clonales/metabolismo , Hematopoyesis/genética , Mosaicismo , Adulto , Anciano , Alelos , Bancos de Muestras Biológicas , Rotura Cromosómica , Sitios Frágiles del Cromosoma/genética , Cromosomas Humanos Par 10/genética , Femenino , Salud , Neoplasias Hematológicas/genética , Neoplasias Hematológicas/mortalidad , Humanos , Masculino , Persona de Mediana Edad , Penetrancia , Reino Unido
20.
Prenat Diagn ; 44(4): 454-464, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38242839

RESUMEN

Advances in sequencing and imaging technologies enable enhanced assessment in the prenatal space, with a goal to diagnose and predict the natural history of disease, to direct targeted therapies, and to implement clinical management, including transfer of care, election of supportive care, and selection of surgical interventions. The current lack of standardization and aggregation stymies variant interpretation and gene discovery, which hinders the provision of prenatal precision medicine, leaving clinicians and patients without an accurate diagnosis. With large amounts of data generated, it is imperative to establish standards for data collection, processing, and aggregation. Aggregated and homogeneously processed genetic and phenotypic data permits dissection of the genomic architecture of prenatal presentations of disease and provides a dataset on which data analysis algorithms can be tuned to the prenatal space. Here we discuss the importance of generating aggregate data sets and how the prenatal space is driving the development of interoperable standards and phenotype-driven tools.


Asunto(s)
Medicina de Precisión , Diagnóstico Prenatal , Embarazo , Femenino , Humanos , Fenotipo , Genómica , Algoritmos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA