Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 80
Filtrar
1.
medRxiv ; 2024 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-38343850

RESUMEN

Most genetic association studies focus on binary variants. To identify the effects of multi-allelic variation of tandem repeats (TRs) on human traits, we performed direct TR genotyping and phenome-wide association studies in 168,554 individuals from the UK Biobank, identifying 47 TRs showing causal associations with 73 traits. We replicated 23 of 31 (74%) of these causal associations in the All of Us cohort. While this set included several known repeat expansion disorders, novel associations we found were attributable to common polymorphic variation in TR length rather than rare expansions and include e.g. a coding polyhistidine motif in HRCT1 influencing risk of hypertension and a poly(CGC) in the 5'UTR of GNB2 influencing heart rate. Causal TRs were strongly enriched for associations with local gene expression and DNA methylation. Our study highlights the contribution of multi-allelic TRs to the "missing heritability" of the human genome.

2.
iScience ; 27(1): 108599, 2024 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-38170020

RESUMEN

Valvular heart disease presents a significant health burden, yet advancements in valve biology and therapeutics have been hindered by the lack of accessibility to human valve cells. In this study, we have developed a scalable and feeder-free method to differentiate human induced pluripotent stem cells (iPSCs) into endocardial cells, which are transcriptionally and phenotypically distinct from vascular endothelial cells. These endocardial cells can be challenged to undergo endothelial-to-mesenchymal transition (EndMT), after which two distinct populations emerge-one population undergoes EndMT to become valvular interstitial cells (VICs), while the other population reinforces their endothelial identity to become valvular endothelial cells (VECs). We then characterized these populations through bulk RNA-seq transcriptome analyses and compared our VIC and VEC populations to pseudobulk data generated from normal valve tissue of a 15-week-old human fetus. By increasing the accessibility to these cell populations, we aim to accelerate discoveries for cardiac valve biology and disease.

3.
medRxiv ; 2023 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-37461547

RESUMEN

Repeat expansion disorders (REDs) are a devastating group of predominantly neurological diseases. Together they are common, affecting 1 in 3,000 people worldwide with population-specific differences. However, prevalence estimates of REDs are hampered by heterogeneous clinical presentation, variable geographic distributions, and technological limitations leading to under-ascertainment. Here, leveraging whole genome sequencing data from 82,176 individuals from different populations we found an overall carrier frequency of REDs of 1 in 340 individuals. Modelling disease prevalence using genetic data, age at onset and survival, we show that REDs are up to 3-fold more prevalent than currently reported figures. While some REDs are population-specific, e.g. Huntington's disease type 2, most REDs are represented in all broad genetic ancestries, including Africans and Asians, challenging the notion that some REDs are found only in European populations. These results have worldwide implications for local and global health communities in the diagnosis and management of REDs both at local and global levels.

4.
BMC Genomics ; 24(1): 226, 2023 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-37127568

RESUMEN

Open reading frames (ORFs) with fewer than 100 codons are generally not annotated in genomes, although bona fide genes of that size are known. Newer biochemical studies have suggested that thousands of small protein-coding ORFs (smORFs) may exist in the human genome, but the true number and the biological significance of the micropeptides they encode remain uncertain. Here, we used a comparative genomics approach to identify high-confidence smORFs that are likely protein-coding. We identified 3,326 high-confidence smORFs using constraint within human populations and evolutionary conservation as additional lines of evidence. Next, we validated that, as a group, our high-confidence smORFs are conserved at the amino-acid level rather than merely residing in highly conserved non-coding regions. Finally, we found that high-confidence smORFs are enriched among disease-associated variants from GWAS. Overall, our results highlight that smORF-encoded peptides likely have important functional roles in human disease.


Asunto(s)
Péptidos , Proteínas , Humanos , Sistemas de Lectura Abierta , Proteínas/genética , Péptidos/genética , Genoma Humano , Micropéptidos
5.
medRxiv ; 2023 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-37205357

RESUMEN

GC-rich tandem repeat expansions (TREs) are often associated with DNA methylation, gene silencing and folate-sensitive fragile sites and underlie several congenital and late-onset disorders. Through a combination of DNA methylation profiling and tandem repeat genotyping, we identified 24 methylated TREs and investigated their effects on human traits using PheWAS in 168,641 individuals from the UK Biobank, identifying 156 significant TRE:trait associations involving 17 different TREs. Of these, a GCC expansion in the promoter of AFF3 was linked with a 2.4-fold reduced probability of completing secondary education, an effect size comparable to several recurrent pathogenic microdeletions. In a cohort of 6,371 probands with neurodevelopmental problems of suspected genetic etiology, we observed a significant enrichment of AFF3 expansions compared to controls. With a population prevalence that is at least 5-fold higher than the TRE that causes fragile X syndrome, AFF3 expansions represent a significant cause of neurodevelopmental delay.

6.
Genome Res ; 33(2): 184-196, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36577521

RESUMEN

Short tandem repeats (STRs) contribute significantly to genetic diversity in humans, including disease-causing variation. Although the effect of STR variation on gene expression has been extensively assessed, their impact on epigenetics has been poorly studied and limited to specific genomic regions. Here, we investigated the hypothesis that some STRs act as independent regulators of local DNA methylation in the human genome and modify risk of common human traits. To address these questions, we first analyzed two independent data sets comprising PCR-free whole-genome sequencing (WGS) and genome-wide DNA methylation levels derived from whole-blood samples in 245 (discovery cohort) and 484 individuals (replication cohort). Using genotypes for 131,635 polymorphic STRs derived from WGS using HipSTR, we identified 11,870 STRs that associated with DNA methylation levels (mSTRs) of 11,774 CpGs (Bonferroni P < 0.001) in our discovery cohort, with 90% successfully replicating in our second cohort. Subsequently, through fine-mapping using CAVIAR we defined 585 of these mSTRs as the likely causal variants underlying the observed associations (fm-mSTRs) and linked a fraction of these to previously reported genome-wide association study signals, providing insights into the mechanisms underlying complex human traits. Furthermore, by integrating gene expression data, we observed that 12.5% of the tested fm-mSTRs also modulate expression levels of nearby genes, reinforcing their regulatory potential. Overall, our findings expand the catalog of functional sequence variants that affect genome regulation, highlighting the importance of incorporating STRs in future genetic association analysis and epigenetics data for the interpretation of trait-associated variants.


Asunto(s)
Metilación de ADN , Estudio de Asociación del Genoma Completo , Humanos , Repeticiones de Microsatélite , Genoma Humano , Genotipo
7.
Am J Hum Genet ; 109(6): 1065-1076, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35609568

RESUMEN

The human genome contains tens of thousands of large tandem repeats and hundreds of genes that show common and highly variable copy-number changes. Due to their large size and repetitive nature, these variable number tandem repeats (VNTRs) and multicopy genes are generally recalcitrant to standard genotyping approaches and, as a result, this class of variation is poorly characterized. However, several recent studies have demonstrated that copy-number variation of VNTRs can modify local gene expression, epigenetics, and human traits, indicating that many have a functional role. Here, using read depth from whole-genome sequencing to profile copy number, we report results of a phenome-wide association study (PheWAS) of VNTRs and multicopy genes in a discovery cohort of ∼35,000 samples, identifying 32 traits associated with copy number of 38 VNTRs and multicopy genes at 1% FDR. We replicated many of these signals in an independent cohort and observed that VNTRs showing trait associations were significantly enriched for expression QTLs with nearby genes, providing strong support for our results. Fine-mapping studies indicated that in the majority (∼90%) of cases, the VNTRs and multicopy genes we identified represent the causal variants underlying the observed associations. Furthermore, several lie in regions where prior SNV-based GWASs have failed to identify any significant associations with these traits. Our study indicates that copy number of VNTRs and multicopy genes contributes to diverse human traits and suggests that complex structural variants potentially explain some of the so-called "missing heritability" of SNV-based GWASs.


Asunto(s)
Variaciones en el Número de Copia de ADN , Repeticiones de Minisatélite , Variaciones en el Número de Copia de ADN/genética , Genoma Humano , Estudio de Asociación del Genoma Completo , Humanos , Repeticiones de Minisatélite/genética , Fenotipo
8.
JAMA Psychiatry ; 79(3): 250-259, 2022 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-35080590

RESUMEN

IMPORTANCE: Past studies identified rare copy number variants (CNVs) as risk factors for neurodevelopmental disorders (NDDs), including autism spectrum disorder and schizophrenia. However, the clinical characterization of NDD CNVs is understudied in population cohorts unselected for neuropsychiatric disorders and in cohorts of diverse ancestry. OBJECTIVE: To identify individuals harboring NDD CNVs in a multiancestry biobank and to query their enrichment for select neuropsychiatric disorders as well as association with multiple medical disorders. DESIGN, SETTINGS, AND PARTICIPANTS: In a series of phenotypic enrichment and association analyses, NDD CNVs were clinically characterized among 24 877 participants in the BioMe biobank, an electronic health record-linked biobank derived from the Mount Sinai Health System, New York, New York. Participants were recruited into the biobank since September 2007 across diverse ancestry and medical and neuropsychiatric specialties. For the current analyses, electronic health record data were analyzed from May 2004 through May 2019. MAIN OUTCOMES AND MEASURES: NDD CNVs were identified using a consensus of 2 CNV calling algorithms, based on whole-exome sequencing and genotype array data, followed by novel in-silico clinical assessments. RESULTS: Of 24 877 participants, 14 586 (58.7%) were female; self-reported ancestry categories included 5965 (24.0%) who were of African ancestry, 7892 (31.7%) who were of European ancestry, and 8536 (34.3%) who were of Hispanic ancestry; and the mean (SD) age was 50.5 (17.3) years. Among 24 877 individuals, the prevalence of 64 NDD CNVs was 2.5% (n = 627), with prevalence varying by locus, corroborating the presence of some relatively highly prevalent NDD CNVs (eg, 15q11.2 deletion/duplication). An aggregate set of NDD CNVs were enriched for congenital disorders (odds ratio, 2.0; 95% CI, 1.1-3.5; P = .01) and major depressive disorder (odds ratio, 1.5; 95% CI, 1.1-2.0; P = .01). In a meta-analysis of medical diagnoses (n = 195 hierarchically clustered diagnostic codes), NDD CNVs were significantly associated with several medical outcomes, including essential hypertension (z score = 3.6; P = 2.8 × 10-4), kidney failure (z score = 3.3; P = 1.1 × 10-3), and obstructive sleep apnea (z score = 3.4; P = 8.1 × 10-4) and, in another analysis, morbid obesity (z score = 3.8; P = 1.3 × 10-4). Further, NDD CNVs were associated with increased body mass index in a multiancestry analysis (ß = 0.19; 95% CI, 0.10-0.31; P = .003). For 36 common serum tests, there was no association with NDD CNVs. CONCLUSIONS AND RELEVANCE: Clinical features of individuals harboring NDD CNVs were elucidated in a large-scale, multiancestry biobank, identifying enrichments for congenital disorders and major depressive disorder as well as associations with several medical outcomes, including hypertension, kidney failure, and obesity and obesity-related phenotypes, specifically obstructive sleep apnea and increased body mass index. The association between NDD CNVs and obesity outcomes indicate further potential pleiotropy of NDD CNVs beyond neurodevelopmental outcomes previously reported. Future clinical genetic investigations may lead to insights of at-risk individuals and therapeutic strategies targeting specific genetic variants. The importance of diverse inclusion within biobanks and considering the effect of rare genetic variants in a multiancestry context is evident.


Asunto(s)
Trastorno del Espectro Autista , Trastorno Depresivo Mayor , Trastornos del Neurodesarrollo , Insuficiencia Renal , Apnea Obstructiva del Sueño , Trastorno del Espectro Autista/diagnóstico , Trastorno del Espectro Autista/epidemiología , Trastorno del Espectro Autista/genética , Bancos de Muestras Biológicas , Variaciones en el Número de Copia de ADN/genética , Femenino , Humanos , Masculino , Trastornos del Neurodesarrollo/diagnóstico , Trastornos del Neurodesarrollo/epidemiología , Trastornos del Neurodesarrollo/genética , Obesidad
9.
PLoS One ; 16(12): e0261374, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34898642

RESUMEN

Lymphoblastoid cell lines (LCLs) have been critical to establishing genetic resources for biomedical science. They have been used extensively to study human genetic diversity, genome function, and inform the development of tools and methodologies for augmenting disease genetics research. While the validity of variant callsets from LCLs has been demonstrated for most of the genome, previous work has shown that DNA extracted from LCLs is modified by V(D)J recombination within the immunoglobulin (IG) loci, regions that harbor antibody genes critical to immune system function. However, the impacts of V(D)J on short read sequencing data generated from LCLs has not been extensively investigated. In this study, we used LCL-derived short read sequencing data from the 1000 Genomes Project (n = 2,504) to identify signatures of V(D)J recombination. Our analyses revealed sample-level impacts of V(D)J recombination that varied depending on the degree of inferred monoclonality. We showed that V(D)J associated somatic deletions impacted genotyping accuracy, leading to adulterated population-level estimates of allele frequency and linkage disequilibrium. These findings illuminate limitations of using LCLs and short read data for building genetic resources in the IG loci, with implications for interpreting previous disease association studies in these regions.


Asunto(s)
Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/metabolismo , Recombinación V(D)J/genética , Alelos , Linfocitos B/inmunología , Línea Celular Tumoral/inmunología , Línea Celular Tumoral/metabolismo , Bases de Datos Genéticas , Frecuencia de los Genes/genética , Genes de Inmunoglobulinas/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Región Variable de Inmunoglobulina/genética , Inmunoglobulinas/genética , Inmunoglobulinas/inmunología , Análisis de Secuencia de ADN/métodos , Recombinación V(D)J/inmunología
10.
Nat Commun ; 12(1): 3517, 2021 06 10.
Artículo en Inglés | MEDLINE | ID: mdl-34112773

RESUMEN

Epigenome-wide association studies of Alzheimer's disease have highlighted neuropathology-associated DNA methylation differences, although existing studies have been limited in sample size and utilized different brain regions. Here, we combine data from six DNA methylomic studies of Alzheimer's disease (N = 1453 unique individuals) to identify differential methylation associated with Braak stage in different brain regions and across cortex. We identify 236 CpGs in the prefrontal cortex, 95 CpGs in the temporal gyrus and ten CpGs in the entorhinal cortex at Bonferroni significance, with none in the cerebellum. Our cross-cortex meta-analysis (N = 1408 donors) identifies 220 CpGs associated with neuropathology, annotated to 121 genes, of which 84 genes have not been previously reported at this significance threshold. We have replicated our findings using two further DNA methylomic datasets consisting of a further >600 unique donors. The meta-analysis summary statistics are available in our online data resource ( www.epigenomicslab.com/ad-meta-analysis/ ).


Asunto(s)
Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/metabolismo , Metilación de ADN , Corteza Entorrinal/metabolismo , Epigenoma , Corteza Prefrontal/metabolismo , Lóbulo Temporal/metabolismo , Anciano , Anciano de 80 o más Años , Enfermedad de Alzheimer/patología , Estudios de Cohortes , Islas de CpG , Corteza Entorrinal/patología , Epigénesis Genética , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Corteza Prefrontal/patología , Curva ROC , Lóbulo Temporal/patología
11.
Am J Hum Genet ; 108(5): 809-824, 2021 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-33794196

RESUMEN

Variable number tandem repeats (VNTRs) are composed of large tandemly repeated motifs, many of which are highly polymorphic in copy number. However, because of their large size and repetitive nature, they remain poorly studied. To investigate the regulatory potential of VNTRs, we used read-depth data from Illumina whole-genome sequencing to perform association analysis between copy number of ∼70,000 VNTRs (motif size ≥ 10 bp) with both gene expression (404 samples in 48 tissues) and DNA methylation (235 samples in peripheral blood), identifying thousands of VNTRs that are associated with local gene expression (eVNTRs) and DNA methylation levels (mVNTRs). Using an independent cohort, we validated 73%-80% of signals observed in the two discovery cohorts, while allelic analysis of VNTR length and CpG methylation in 30 Oxford Nanopore genomes gave additional support for mVNTR loci, thus providing robust evidence to support that these represent genuine associations. Further, conditional analysis indicated that many eVNTRs and mVNTRs act as QTLs independently of other local variation. We also observed strong enrichments of eVNTRs and mVNTRs for regulatory features such as enhancers and promoters. Using the Human Genome Diversity Panel, we define sets of VNTRs that show highly divergent copy numbers among human populations and show that these are enriched for regulatory effects and preferentially associate with genes that have been linked with human phenotypes through GWASs. Our study provides strong evidence supporting functional variation at thousands of VNTRs and defines candidate sets of VNTRs, copy number variation of which potentially plays a role in numerous human phenotypes.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Metilación de ADN , Regulación de la Expresión Génica , Repeticiones de Minisatélite/genética , Sitios de Carácter Cuantitativo/genética , Adolescente , Adulto , Algoritmos , Niño , Preescolar , Cromosomas Humanos X/genética , Estudios de Cohortes , Islas de CpG/genética , Elementos de Facilitación Genéticos/genética , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Fenotipo , Regiones Promotoras Genéticas/genética , Adulto Joven
12.
PLoS Genet ; 16(11): e1009189, 2020 11.
Artículo en Inglés | MEDLINE | ID: mdl-33216750

RESUMEN

Although DNA methylation is the best characterized epigenetic mark, the mechanism by which it is targeted to specific regions in the genome remains unclear. Recent studies have revealed that local DNA methylation profiles might be dictated by cis-regulatory DNA sequences that mainly operate via DNA-binding factors. Consistent with this finding, we have recently shown that disruption of CTCF-binding sites by rare single nucleotide variants (SNVs) can underlie cis-linked DNA methylation changes in patients with congenital anomalies. These data raise the hypothesis that rare genetic variation at transcription factor binding sites (TFBSs) might contribute to local DNA methylation patterning. In this work, by combining blood genome-wide DNA methylation profiles, whole genome sequencing-derived SNVs from 247 unrelated individuals along with 133 predicted TFBS motifs derived from ENCODE ChIP-Seq data, we observed an association between the disruption of binding sites for multiple TFs by rare SNVs and extreme DNA methylation values at both local and, to a lesser extent, distant CpGs. While the majority of these changes affected only single CpGs, 24% were associated with multiple outlier CpGs within ±1kb of the disrupted TFBS. Interestingly, disruption of functionally constrained sites within TF motifs lead to larger DNA methylation changes at nearby CpG sites. Altogether, these findings suggest that rare SNVs at TFBS negatively influence TF-DNA binding, which can lead to an altered local DNA methylation profile. Furthermore, subsequent integration of DNA methylation and RNA-Seq profiles from cardiac tissues enabled us to observe an association between rare SNV-directed DNA methylation and outlier expression of nearby genes. In conclusion, our findings not only provide insights into the effect of rare genetic variation at TFBS on shaping local DNA methylation and its consequences on genome regulation, but also provide a rationale to incorporate DNA methylation data to interpret the functional role of rare variants.


Asunto(s)
Islas de CpG/genética , Metilación de ADN , Epigénesis Genética , Genoma Humano/genética , Factores de Transcripción/metabolismo , Adolescente , Adulto , Sitios de Unión/genética , Niño , Preescolar , Secuenciación de Inmunoprecipitación de Cromatina , Estudios de Cohortes , Femenino , Cardiopatías Congénitas/sangre , Cardiopatías Congénitas/genética , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma , Adulto Joven
13.
Front Immunol ; 11: 2136, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-33072076

RESUMEN

An incomplete ascertainment of genetic variation within the highly polymorphic immunoglobulin heavy chain locus (IGH) has hindered our ability to define genetic factors that influence antibody-mediated processes. Due to locus complexity, standard high-throughput approaches have failed to accurately and comprehensively capture IGH polymorphism. As a result, the locus has only been fully characterized two times, severely limiting our knowledge of human IGH diversity. Here, we combine targeted long-read sequencing with a novel bioinformatics tool, IGenotyper, to fully characterize IGH variation in a haplotype-specific manner. We apply this approach to eight human samples, including a haploid cell line and two mother-father-child trios, and demonstrate the ability to generate high-quality assemblies (>98% complete and >99% accurate), genotypes, and gene annotations, identifying 2 novel structural variants and 15 novel IGH alleles. We show multiplexing allows for scaling of the approach without impacting data quality, and that our genotype call sets are more accurate than short-read (>35% increase in true positives and >97% decrease in false-positives) and array/imputation-based datasets. This framework establishes a desperately needed foundation for leveraging IG genomic data to study population-level variation in antibody-mediated immunity, critical for bettering our understanding of disease risk, and responses to vaccines and therapeutics.


Asunto(s)
Biología Computacional/métodos , Genes de Inmunoglobulinas , Variación Genética , Técnicas de Genotipaje , Haplotipos/genética , Cadenas Pesadas de Inmunoglobulina/genética , Polimorfismo Genético , Línea Celular , Presentación de Datos , Conjuntos de Datos como Asunto , Familia , Biblioteca de Genes , Variación Estructural del Genoma , Humanos , Anotación de Secuencia Molecular , Alineación de Secuencia , Análisis de Secuencia de ADN , Homología de Secuencia de Ácido Nucleico , Interfaz Usuario-Computador , Flujo de Trabajo
14.
Am J Hum Genet ; 107(4): 654-669, 2020 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-32937144

RESUMEN

There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.


Asunto(s)
Proteína BRCA1/genética , Epigénesis Genética , Enfermedades Genéticas Congénitas/genética , Genoma Humano , Receptores de LDL/genética , Expansión de Repetición de Trinucleótido , Proteína BRCA1/metabolismo , Metilación de ADN , Femenino , Ácido Fólico/metabolismo , Silenciador del Gen , Enfermedades Genéticas Congénitas/diagnóstico , Enfermedades Genéticas Congénitas/patología , Sitios Genéticos , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Regiones Promotoras Genéticas , Receptores de LDL/metabolismo , Gemelos Monocigóticos
15.
Am J Hum Genet ; 107(3): 555-563, 2020 09 03.
Artículo en Inglés | MEDLINE | ID: mdl-32758449

RESUMEN

Helsmoortel-Van der Aa syndrome (HVDAS) is a neurodevelopmental condition associated with intellectual disability/developmental delay, autism spectrum disorder, and multiple medical comorbidities. HVDAS is caused by mutations in activity-dependent neuroprotective protein (ADNP). A recent study identified genome-wide DNA methylation changes in 22 individuals with HVDAS, adding to the group of neurodevelopmental disorders with an epigenetic signature. This methylation signature segregated those with HVDAS into two groups based on the location of the mutations. Here, we conducted an independent study on 24 individuals with HVDAS and replicated the existence of the two mutation-dependent episignatures. To probe whether the two distinct episignatures correlate with clinical outcomes, we used deep behavioral and neurobiological data from two prospective cohorts of individuals with a genetic diagnosis of HVDAS. We found limited phenotypic differences between the two HVDAS-affected groups and no evidence that individuals with more widespread methylation changes are more severely affected. Moreover, in spite of the methylation changes, we observed no profound alterations in the blood transcriptome of individuals with HVDAS. Our data warrant caution in harnessing methylation signatures in HVDAS as a tool for clinical stratification, at least with regard to behavioral phenotypes.


Asunto(s)
Trastorno del Espectro Autista/genética , Proteínas de Homeodominio/genética , Discapacidad Intelectual/genética , Proteínas del Tejido Nervioso/genética , Trastornos del Neurodesarrollo/genética , Trastorno del Espectro Autista/patología , Niño , Metilación de ADN/genética , Discapacidades del Desarrollo/genética , Discapacidades del Desarrollo/patología , Epigénesis Genética/genética , Femenino , Humanos , Discapacidad Intelectual/patología , Masculino , Mutación/genética , Trastornos del Neurodesarrollo/patología , Fenotipo , Transcriptoma/genética
16.
Hum Mutat ; 41(4): 800-806, 2020 04.
Artículo en Inglés | MEDLINE | ID: mdl-31898844

RESUMEN

The mechanisms underlying de novo insertion/deletion (indel) genesis, such as polymerase slippage, have been hypothesized but not well characterized in the human genome. We implemented two methodological improvements, which were leveraged to dissect indel mutagenesis. We assigned de novo variants to parent-of-origin (i.e., phasing) with low-coverage long-read whole-genome sequencing, achieving better phasing compared to short-read sequencing (medians of 84% and 23%, respectively). We then wrote an application programming interface to classify indels into three subtypes according to sequence context. Across three cohorts with different phasing methods (Ntrios = 540, all cohorts), we observed that one de novo indel subtype, change in copy count (CCC), was significantly correlated with father's (p = 7.1 × 10-4 ) but not mother's (p = .45) age at conception. We replicated this effect in three cohorts without de novo phasing (ppaternal = 1.9 × 10-9 , pmaternal = .61; Ntrios = 3,391, all cohorts). Although this is consistent with polymerase slippage during spermatogenesis, the percentage of variance explained by paternal age was low, and we did not observe an association with replication timing. These results suggest that spermatogenesis-specific events have a minor role in CCC indel mutagenesis, one not observed for other indel subtypes nor for maternal age in general. These results have implications for indel modeling in evolution and disease.


Asunto(s)
Biología Computacional/métodos , Genoma Humano , Genómica/métodos , Mutación INDEL , Programas Informáticos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Polimorfismo de Nucleótido Simple
17.
Bioinformatics ; 36(3): 922-924, 2020 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-31397844

RESUMEN

SUMMARY: While next-generation sequencing (NGS) has dramatically increased the availability of genomic data, phased genome assembly and structural variant (SV) analyses are limited by NGS read lengths. Long-read sequencing from Pacific Biosciences and NGS barcoding from 10x Genomics hold the potential for far more comprehensive views of individual genomes. Here, we present MsPAC, a tool that combines both technologies to partition reads, assemble haplotypes (via existing software) and convert assemblies into high-quality, phased SV predictions. MsPAC represents a framework for haplotype-resolved SV calls that moves one step closer to fully resolved, diploid genomes. AVAILABILITY AND IMPLEMENTATION: https://github.com/oscarlr/MsPAC. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Genoma , Haplotipos , Análisis de Secuencia de ADN , Programas Informáticos
18.
J Allergy Clin Immunol ; 145(4): 1219-1230, 2020 04.
Artículo en Inglés | MEDLINE | ID: mdl-31838046

RESUMEN

BACKGROUND: Unexpected allergic reactions to peanut are the most common cause of fatal food-related anaphylaxis. Mechanisms underlying the variable severity of peanut-allergic reactions remain unclear. OBJECTIVES: We sought to expand mechanistic understanding of reaction severity in peanut allergy. METHODS: We performed an integrated transcriptomic and epigenomic study of peanut-allergic children as they reacted in vivo during double-blind, placebo-controlled peanut challenges. We integrated whole-blood transcriptome and CD4+ T-cell epigenome profiles to identify molecular signatures of reaction severity (ie, how severely a peanut-allergic child reacts when exposed to peanut). A threshold-weighted reaction severity score was calculated for each subject based on symptoms experienced during peanut challenge and the eliciting dose. Through linear mixed effects modeling, network construction, and causal mediation analysis, we identified genes, CpGs, and their interactions that mediate reaction severity. Findings were replicated in an independent cohort. RESULTS: We identified 318 genes with changes in expression during the course of reaction associated with reaction severity, and 203 CpG sites with differential DNA methylation associated with reaction severity. After replicating these findings in an independent cohort, we constructed interaction networks with the identified peanut severity genes and CpGs. These analyses and leukocyte deconvolution highlighted neutrophil-mediated immunity. We identified NFKBIA and ARG1 as hubs in the networks and 3 groups of interacting key node CpGs and peanut severity genes encompassing immune response, chemotaxis, and regulation of macroautophagy. In addition, we found that gene expression of PHACTR1 and ZNF121 causally mediates the association between methylation at corresponding CpGs and reaction severity, suggesting that methylation may serve as an anchor upon which gene expression modulates reaction severity. CONCLUSIONS: Our findings enhance current mechanistic understanding of the genetic and epigenetic architecture of reaction severity in peanut allergy.


Asunto(s)
Anafilaxia/genética , Linfocitos T CD4-Positivos/fisiología , Hipersensibilidad al Cacahuete/genética , Adolescente , Alérgenos/inmunología , Arachis/inmunología , Niño , Estudios de Cohortes , Metilación de ADN , Progresión de la Enfermedad , Epigénesis Genética , Femenino , Redes Reguladoras de Genes , Humanos , Inmunidad/genética , Inmunización , Masculino , Transcriptoma
19.
BMC Biol ; 17(1): 50, 2019 06 24.
Artículo en Inglés | MEDLINE | ID: mdl-31234833

RESUMEN

BACKGROUND: Identification of imprinted genes, demonstrating a consistent preference towards the paternal or maternal allelic expression, is important for the understanding of gene expression regulation during embryonic development and of the molecular basis of developmental disorders with a parent-of-origin effect. Combining allelic analysis of RNA-Seq data with phased genotypes in family trios provides a powerful method to detect parent-of-origin biases in gene expression. RESULTS: We report findings in 296 family trios from two large studies: 165 lymphoblastoid cell lines from the 1000 Genomes Project and 131 blood samples from the Genome of the Netherlands (GoNL) participants. Based on parental haplotypes, we identified > 2.8 million transcribed heterozygous SNVs phased for parental origin and developed a robust statistical framework for measuring allelic expression. We identified a total of 45 imprinted genes and one imprinted unannotated transcript, including multiple imprinted transcripts showing incomplete parental expression bias that was located adjacent to strongly imprinted genes. For example, PXDC1, a gene which lies adjacent to the paternally expressed gene FAM50B, shows a 2:1 paternal expression bias. Other imprinted genes had promoter regions that coincide with sites of parentally biased DNA methylation identified in the blood from uniparental disomy (UPD) samples, thus providing independent validation of our results. Using the stranded nature of the RNA-Seq data in lymphoblastoid cell lines, we identified multiple loci with overlapping sense/antisense transcripts, of which one is expressed paternally and the other maternally. Using a sliding window approach, we searched for imprinted expression across the entire genome, identifying a novel imprinted putative lncRNA in 13q21.2. Overall, we identified 7 transcripts showing parental bias in gene expression which were not reported in 4 other recent RNA-Seq studies of imprinting. CONCLUSIONS: Our methods and data provide a robust and high-resolution map of imprinted gene expression in the human genome.


Asunto(s)
Alelos , Expresión Génica/genética , Impresión Genómica/genética , Haplotipos/genética , Análisis Químico de la Sangre , Línea Celular , Humanos , Análisis de Secuencia de ARN
20.
Hum Mutat ; 40(7): 952-961, 2019 07.
Artículo en Inglés | MEDLINE | ID: mdl-30900359

RESUMEN

While many studies have led to the identification of rare sequence variants linked with susceptibility to autism and schizophrenia, the contribution of rare epigenetic variations (epivariations) in these disorders remains largely unexplored. Previously we presented evidence that epivariations occur relatively frequently in the human genome, and likely contribute to a subset of congenital and neurodevelopmental disorders through the disruption of dosage-sensitive genes. Here we extend this approach, studying methylation profiles from 297 samples with autism and 767 cases with schizophrenia, identifying 84 and 268 rare epivariations in these two cohorts, respectively, that were absent from 4,860 population controls. We observed multiple features associated with these epivariations that support their pathogenic relevance, including (a) a significant enrichment for epivariations in schizophrenic individuals at genes previously linked with schizophrenia, (b) increased brain expression of genes associated with epivariations found in autism cases compared with controls, (c) in autism families, a significant excess of epivariations found specifically in affected versus unaffected sibs, (d) Gene Ontology terms linked with epivariations found in autism, including "D1 dopamine receptor binding." Our study provides additional evidence that rare epivariations likely contribute to the mutational spectra underlying neurodevelopmental disorders.


Asunto(s)
Trastorno Autístico/genética , Metilación de ADN , Esquizofrenia/genética , Estudios de Casos y Controles , Epigénesis Genética , Femenino , Ontología de Genes , Predisposición Genética a la Enfermedad , Humanos , Masculino
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...