Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Genome Res ; 33(2): 184-196, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36577521

RESUMEN

Short tandem repeats (STRs) contribute significantly to genetic diversity in humans, including disease-causing variation. Although the effect of STR variation on gene expression has been extensively assessed, their impact on epigenetics has been poorly studied and limited to specific genomic regions. Here, we investigated the hypothesis that some STRs act as independent regulators of local DNA methylation in the human genome and modify risk of common human traits. To address these questions, we first analyzed two independent data sets comprising PCR-free whole-genome sequencing (WGS) and genome-wide DNA methylation levels derived from whole-blood samples in 245 (discovery cohort) and 484 individuals (replication cohort). Using genotypes for 131,635 polymorphic STRs derived from WGS using HipSTR, we identified 11,870 STRs that associated with DNA methylation levels (mSTRs) of 11,774 CpGs (Bonferroni P < 0.001) in our discovery cohort, with 90% successfully replicating in our second cohort. Subsequently, through fine-mapping using CAVIAR we defined 585 of these mSTRs as the likely causal variants underlying the observed associations (fm-mSTRs) and linked a fraction of these to previously reported genome-wide association study signals, providing insights into the mechanisms underlying complex human traits. Furthermore, by integrating gene expression data, we observed that 12.5% of the tested fm-mSTRs also modulate expression levels of nearby genes, reinforcing their regulatory potential. Overall, our findings expand the catalog of functional sequence variants that affect genome regulation, highlighting the importance of incorporating STRs in future genetic association analysis and epigenetics data for the interpretation of trait-associated variants.


Asunto(s)
Metilación de ADN , Estudio de Asociación del Genoma Completo , Humanos , Repeticiones de Microsatélite , Genoma Humano , Genotipo
2.
Am J Hum Genet ; 109(6): 1065-1076, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35609568

RESUMEN

The human genome contains tens of thousands of large tandem repeats and hundreds of genes that show common and highly variable copy-number changes. Due to their large size and repetitive nature, these variable number tandem repeats (VNTRs) and multicopy genes are generally recalcitrant to standard genotyping approaches and, as a result, this class of variation is poorly characterized. However, several recent studies have demonstrated that copy-number variation of VNTRs can modify local gene expression, epigenetics, and human traits, indicating that many have a functional role. Here, using read depth from whole-genome sequencing to profile copy number, we report results of a phenome-wide association study (PheWAS) of VNTRs and multicopy genes in a discovery cohort of ∼35,000 samples, identifying 32 traits associated with copy number of 38 VNTRs and multicopy genes at 1% FDR. We replicated many of these signals in an independent cohort and observed that VNTRs showing trait associations were significantly enriched for expression QTLs with nearby genes, providing strong support for our results. Fine-mapping studies indicated that in the majority (∼90%) of cases, the VNTRs and multicopy genes we identified represent the causal variants underlying the observed associations. Furthermore, several lie in regions where prior SNV-based GWASs have failed to identify any significant associations with these traits. Our study indicates that copy number of VNTRs and multicopy genes contributes to diverse human traits and suggests that complex structural variants potentially explain some of the so-called "missing heritability" of SNV-based GWASs.


Asunto(s)
Variaciones en el Número de Copia de ADN , Repeticiones de Minisatélite , Variaciones en el Número de Copia de ADN/genética , Genoma Humano , Estudio de Asociación del Genoma Completo , Humanos , Repeticiones de Minisatélite/genética , Fenotipo
3.
Am J Hum Genet ; 108(5): 809-824, 2021 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-33794196

RESUMEN

Variable number tandem repeats (VNTRs) are composed of large tandemly repeated motifs, many of which are highly polymorphic in copy number. However, because of their large size and repetitive nature, they remain poorly studied. To investigate the regulatory potential of VNTRs, we used read-depth data from Illumina whole-genome sequencing to perform association analysis between copy number of ∼70,000 VNTRs (motif size ≥ 10 bp) with both gene expression (404 samples in 48 tissues) and DNA methylation (235 samples in peripheral blood), identifying thousands of VNTRs that are associated with local gene expression (eVNTRs) and DNA methylation levels (mVNTRs). Using an independent cohort, we validated 73%-80% of signals observed in the two discovery cohorts, while allelic analysis of VNTR length and CpG methylation in 30 Oxford Nanopore genomes gave additional support for mVNTR loci, thus providing robust evidence to support that these represent genuine associations. Further, conditional analysis indicated that many eVNTRs and mVNTRs act as QTLs independently of other local variation. We also observed strong enrichments of eVNTRs and mVNTRs for regulatory features such as enhancers and promoters. Using the Human Genome Diversity Panel, we define sets of VNTRs that show highly divergent copy numbers among human populations and show that these are enriched for regulatory effects and preferentially associate with genes that have been linked with human phenotypes through GWASs. Our study provides strong evidence supporting functional variation at thousands of VNTRs and defines candidate sets of VNTRs, copy number variation of which potentially plays a role in numerous human phenotypes.


Asunto(s)
Variaciones en el Número de Copia de ADN/genética , Metilación de ADN , Regulación de la Expresión Génica , Repeticiones de Minisatélite/genética , Sitios de Carácter Cuantitativo/genética , Adolescente , Adulto , Algoritmos , Niño , Preescolar , Cromosomas Humanos X/genética , Estudios de Cohortes , Islas de CpG/genética , Elementos de Facilitación Genéticos/genética , Femenino , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Fenotipo , Regiones Promotoras Genéticas/genética , Adulto Joven
4.
Am J Hum Genet ; 107(4): 654-669, 2020 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-32937144

RESUMEN

There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.


Asunto(s)
Proteína BRCA1/genética , Epigénesis Genética , Enfermedades Genéticas Congénitas/genética , Genoma Humano , Receptores de LDL/genética , Expansión de Repetición de Trinucleótido , Proteína BRCA1/metabolismo , Metilación de ADN , Femenino , Ácido Fólico/metabolismo , Silenciador del Gen , Enfermedades Genéticas Congénitas/diagnóstico , Enfermedades Genéticas Congénitas/patología , Sitios Genéticos , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Masculino , Regiones Promotoras Genéticas , Receptores de LDL/metabolismo , Gemelos Monocigóticos
5.
PLoS Genet ; 16(11): e1009189, 2020 11.
Artículo en Inglés | MEDLINE | ID: mdl-33216750

RESUMEN

Although DNA methylation is the best characterized epigenetic mark, the mechanism by which it is targeted to specific regions in the genome remains unclear. Recent studies have revealed that local DNA methylation profiles might be dictated by cis-regulatory DNA sequences that mainly operate via DNA-binding factors. Consistent with this finding, we have recently shown that disruption of CTCF-binding sites by rare single nucleotide variants (SNVs) can underlie cis-linked DNA methylation changes in patients with congenital anomalies. These data raise the hypothesis that rare genetic variation at transcription factor binding sites (TFBSs) might contribute to local DNA methylation patterning. In this work, by combining blood genome-wide DNA methylation profiles, whole genome sequencing-derived SNVs from 247 unrelated individuals along with 133 predicted TFBS motifs derived from ENCODE ChIP-Seq data, we observed an association between the disruption of binding sites for multiple TFs by rare SNVs and extreme DNA methylation values at both local and, to a lesser extent, distant CpGs. While the majority of these changes affected only single CpGs, 24% were associated with multiple outlier CpGs within ±1kb of the disrupted TFBS. Interestingly, disruption of functionally constrained sites within TF motifs lead to larger DNA methylation changes at nearby CpG sites. Altogether, these findings suggest that rare SNVs at TFBS negatively influence TF-DNA binding, which can lead to an altered local DNA methylation profile. Furthermore, subsequent integration of DNA methylation and RNA-Seq profiles from cardiac tissues enabled us to observe an association between rare SNV-directed DNA methylation and outlier expression of nearby genes. In conclusion, our findings not only provide insights into the effect of rare genetic variation at TFBS on shaping local DNA methylation and its consequences on genome regulation, but also provide a rationale to incorporate DNA methylation data to interpret the functional role of rare variants.


Asunto(s)
Islas de CpG/genética , Metilación de ADN , Epigénesis Genética , Genoma Humano/genética , Factores de Transcripción/metabolismo , Adolescente , Adulto , Sitios de Unión/genética , Niño , Preescolar , Secuenciación de Inmunoprecipitación de Cromatina , Estudios de Cohortes , Femenino , Cardiopatías Congénitas/sangre , Cardiopatías Congénitas/genética , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma , Adulto Joven
6.
BMC Biol ; 17(1): 50, 2019 06 24.
Artículo en Inglés | MEDLINE | ID: mdl-31234833

RESUMEN

BACKGROUND: Identification of imprinted genes, demonstrating a consistent preference towards the paternal or maternal allelic expression, is important for the understanding of gene expression regulation during embryonic development and of the molecular basis of developmental disorders with a parent-of-origin effect. Combining allelic analysis of RNA-Seq data with phased genotypes in family trios provides a powerful method to detect parent-of-origin biases in gene expression. RESULTS: We report findings in 296 family trios from two large studies: 165 lymphoblastoid cell lines from the 1000 Genomes Project and 131 blood samples from the Genome of the Netherlands (GoNL) participants. Based on parental haplotypes, we identified > 2.8 million transcribed heterozygous SNVs phased for parental origin and developed a robust statistical framework for measuring allelic expression. We identified a total of 45 imprinted genes and one imprinted unannotated transcript, including multiple imprinted transcripts showing incomplete parental expression bias that was located adjacent to strongly imprinted genes. For example, PXDC1, a gene which lies adjacent to the paternally expressed gene FAM50B, shows a 2:1 paternal expression bias. Other imprinted genes had promoter regions that coincide with sites of parentally biased DNA methylation identified in the blood from uniparental disomy (UPD) samples, thus providing independent validation of our results. Using the stranded nature of the RNA-Seq data in lymphoblastoid cell lines, we identified multiple loci with overlapping sense/antisense transcripts, of which one is expressed paternally and the other maternally. Using a sliding window approach, we searched for imprinted expression across the entire genome, identifying a novel imprinted putative lncRNA in 13q21.2. Overall, we identified 7 transcripts showing parental bias in gene expression which were not reported in 4 other recent RNA-Seq studies of imprinting. CONCLUSIONS: Our methods and data provide a robust and high-resolution map of imprinted gene expression in the human genome.


Asunto(s)
Alelos , Expresión Génica/genética , Impresión Genómica/genética , Haplotipos/genética , Análisis Químico de la Sangre , Línea Celular , Humanos , Análisis de Secuencia de ARN
7.
iScience ; 27(1): 108599, 2024 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-38170020

RESUMEN

Valvular heart disease presents a significant health burden, yet advancements in valve biology and therapeutics have been hindered by the lack of accessibility to human valve cells. In this study, we have developed a scalable and feeder-free method to differentiate human induced pluripotent stem cells (iPSCs) into endocardial cells, which are transcriptionally and phenotypically distinct from vascular endothelial cells. These endocardial cells can be challenged to undergo endothelial-to-mesenchymal transition (EndMT), after which two distinct populations emerge-one population undergoes EndMT to become valvular interstitial cells (VICs), while the other population reinforces their endothelial identity to become valvular endothelial cells (VECs). We then characterized these populations through bulk RNA-seq transcriptome analyses and compared our VIC and VEC populations to pseudobulk data generated from normal valve tissue of a 15-week-old human fetus. By increasing the accessibility to these cell populations, we aim to accelerate discoveries for cardiac valve biology and disease.

8.
medRxiv ; 2024 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-38343850

RESUMEN

Most genetic association studies focus on binary variants. To identify the effects of multi-allelic variation of tandem repeats (TRs) on human traits, we performed direct TR genotyping and phenome-wide association studies in 168,554 individuals from the UK Biobank, identifying 47 TRs showing causal associations with 73 traits. We replicated 23 of 31 (74%) of these causal associations in the All of Us cohort. While this set included several known repeat expansion disorders, novel associations we found were attributable to common polymorphic variation in TR length rather than rare expansions and include e.g. a coding polyhistidine motif in HRCT1 influencing risk of hypertension and a poly(CGC) in the 5'UTR of GNB2 influencing heart rate. Causal TRs were strongly enriched for associations with local gene expression and DNA methylation. Our study highlights the contribution of multi-allelic TRs to the "missing heritability" of the human genome.

9.
medRxiv ; 2023 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-37205357

RESUMEN

GC-rich tandem repeat expansions (TREs) are often associated with DNA methylation, gene silencing and folate-sensitive fragile sites and underlie several congenital and late-onset disorders. Through a combination of DNA methylation profiling and tandem repeat genotyping, we identified 24 methylated TREs and investigated their effects on human traits using PheWAS in 168,641 individuals from the UK Biobank, identifying 156 significant TRE:trait associations involving 17 different TREs. Of these, a GCC expansion in the promoter of AFF3 was linked with a 2.4-fold reduced probability of completing secondary education, an effect size comparable to several recurrent pathogenic microdeletions. In a cohort of 6,371 probands with neurodevelopmental problems of suspected genetic etiology, we observed a significant enrichment of AFF3 expansions compared to controls. With a population prevalence that is at least 5-fold higher than the TRE that causes fragile X syndrome, AFF3 expansions represent a significant cause of neurodevelopmental delay.

10.
medRxiv ; 2023 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-37461547

RESUMEN

Repeat expansion disorders (REDs) are a devastating group of predominantly neurological diseases. Together they are common, affecting 1 in 3,000 people worldwide with population-specific differences. However, prevalence estimates of REDs are hampered by heterogeneous clinical presentation, variable geographic distributions, and technological limitations leading to under-ascertainment. Here, leveraging whole genome sequencing data from 82,176 individuals from different populations we found an overall carrier frequency of REDs of 1 in 340 individuals. Modelling disease prevalence using genetic data, age at onset and survival, we show that REDs are up to 3-fold more prevalent than currently reported figures. While some REDs are population-specific, e.g. Huntington's disease type 2, most REDs are represented in all broad genetic ancestries, including Africans and Asians, challenging the notion that some REDs are found only in European populations. These results have worldwide implications for local and global health communities in the diagnosis and management of REDs both at local and global levels.

11.
NPJ Parkinsons Dis ; 9(1): 33, 2023 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-36871034

RESUMEN

Open science and collaboration are necessary to facilitate the advancement of Parkinson's disease (PD) research. Hackathons are collaborative events that bring together people with different skill sets and backgrounds to generate resources and creative solutions to problems. These events can be used as training and networking opportunities, thus we coordinated a virtual 3-day hackathon event, during which 49 early-career scientists from 12 countries built tools and pipelines with a focus on PD. Resources were created with the goal of helping scientists accelerate their own research by having access to the necessary code and tools. Each team was allocated one of nine different projects, each with a different goal. These included developing post-genome-wide association studies (GWAS) analysis pipelines, downstream analysis of genetic variation pipelines, and various visualization tools. Hackathons are a valuable approach to inspire creative thinking, supplement training in data science, and foster collaborative scientific relationships, which are foundational practices for early-career researchers. The resources generated can be used to accelerate research on the genetics of PD.

12.
Genome Med ; 14(1): 84, 2022 08 11.
Artículo en Inglés | MEDLINE | ID: mdl-35948990

RESUMEN

BACKGROUND: Expansions of short tandem repeats are the cause of many neurogenetic disorders including familial amyotrophic lateral sclerosis, Huntington disease, and many others. Multiple methods have been recently developed that can identify repeat expansions in whole genome or exome sequencing data. Despite the widely recognized need for visual assessment of variant calls in clinical settings, current computational tools lack the ability to produce such visualizations for repeat expansions. Expanded repeats are difficult to visualize because they correspond to large insertions relative to the reference genome and involve many misaligning and ambiguously aligning reads. RESULTS: We implemented REViewer, a computational method for visualization of sequencing data in genomic regions containing long repeat expansions and FlipBook, a companion image viewer designed for manual curation of large collections of REViewer images. To generate a read pileup, REViewer reconstructs local haplotype sequences and distributes reads to these haplotypes in a way that is most consistent with the fragment lengths and evenness of read coverage. To create appropriate training materials for onboarding new users, we performed a concordance study involving 12 scientists involved in short tandem repeat research. We used the results of this study to create a user guide that describes the basic principles of using REViewer as well as a guide to the typical features of read pileups that correspond to low confidence repeat genotype calls. Additionally, we demonstrated that REViewer can be used to annotate clinically relevant repeat interruptions by comparing visual assessment results of 44 FMR1 repeat alleles with the results of triplet repeat primed PCR. For 38 of these alleles, the results of visual assessment were consistent with triplet repeat primed PCR. CONCLUSIONS: Read pileup plots generated by REViewer offer an intuitive way to visualize sequencing data in regions containing long repeat expansions. Laboratories can use REViewer and FlipBook to assess the quality of repeat genotype calls as well as to visually detect interruptions or other imperfections in the repeat sequence and the surrounding flanking regions. REViewer and FlipBook are available under open-source licenses at https://github.com/illumina/REViewer and https://github.com/broadinstitute/flipbook respectively.


Asunto(s)
Esclerosis Amiotrófica Lateral , Secuencias Repetidas en Tándem , Alelos , Esclerosis Amiotrófica Lateral/genética , Exoma , Proteína de la Discapacidad Intelectual del Síndrome del Cromosoma X Frágil/genética , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos
13.
Nat Commun ; 9(1): 2064, 2018 05 25.
Artículo en Inglés | MEDLINE | ID: mdl-29802345

RESUMEN

Certain human traits such as neurodevelopmental disorders (NDs) and congenital anomalies (CAs) are believed to be primarily genetic in origin. However, even after whole-genome sequencing (WGS), a substantial fraction of such disorders remain unexplained. We hypothesize that some cases of ND-CA are caused by aberrant DNA methylation leading to dysregulated genome function. Comparing DNA methylation profiles from 489 individuals with ND-CAs against 1534 controls, we identify epivariations as a frequent occurrence in the human genome. De novo epivariations are significantly enriched in cases, while RNAseq analysis shows that epivariations often have an impact on gene expression comparable to loss-of-function mutations. Additionally, we detect and replicate an enrichment of rare sequence mutations overlapping CTCF binding sites close to epivariations, providing a rationale for interpreting non-coding variation. We propose that epivariations contribute to the pathogenesis of some patients with unexplained ND-CAs, and as such likely have diagnostic relevance.


Asunto(s)
Anomalías Congénitas/genética , Epigénesis Genética , Genoma Humano/genética , Trastornos del Neurodesarrollo/genética , Adolescente , Adulto , Estudios de Casos y Controles , Niño , Preescolar , Estudios de Cohortes , Metilación de ADN/genética , Conjuntos de Datos como Asunto , Epigenómica/métodos , Humanos , Lactante , Recién Nacido , Mutación con Pérdida de Función/genética , Masculino , Persona de Mediana Edad , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN , Adulto Joven
15.
Nat Commun ; 8: 14428, 2017 02 14.
Artículo en Inglés | MEDLINE | ID: mdl-28195173

RESUMEN

The recent identification of progenitor populations that contribute to the developing heart in a distinct spatial and temporal manner has fundamentally improved our understanding of cardiac development. However, the mechanisms that direct atrial versus ventricular specification remain largely unknown. Here we report the identification of a progenitor population that gives rise primarily to cardiovascular cells of the ventricles and only to few atrial cells (<5%) of the differentiated heart. These progenitors are specified during gastrulation, when they transiently express Foxa2, a gene not previously implicated in cardiac development. Importantly, Foxa2+ cells contribute to previously identified progenitor populations in a defined pattern and ratio. Lastly, we describe an analogous Foxa2+ population during differentiation of embryonic stem cells. Together, these findings provide insight into the developmental origin of ventricular and atrial cells, and may lead to the establishment of new strategies for generating chamber-specific cell types from pluripotent stem cells.


Asunto(s)
Diferenciación Celular/fisiología , Ventrículos Cardíacos/citología , Ventrículos Cardíacos/crecimiento & desarrollo , Factor Nuclear 3-beta del Hepatocito/metabolismo , Animales , Línea Celular , Desarrollo Embrionario/fisiología , Femenino , Gastrulación/fisiología , Regulación del Desarrollo de la Expresión Génica , Atrios Cardíacos/citología , Atrios Cardíacos/diagnóstico por imagen , Atrios Cardíacos/crecimiento & desarrollo , Atrios Cardíacos/metabolismo , Ventrículos Cardíacos/diagnóstico por imagen , Factor Nuclear 3-beta del Hepatocito/genética , Mesodermo/citología , Mesodermo/crecimiento & desarrollo , Mesodermo/metabolismo , Ratones , Ratones Endogámicos C57BL , Células Madre Embrionarias de Ratones/citología , Células Madre Embrionarias de Ratones/metabolismo , Células Madre Pluripotentes/citología , Células Madre Pluripotentes/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA