Búsqueda | Biblioteca Virtual en Salud

iFunMed: Integrative functional mediation analysis of GWAS and eQTL studies.

Rojo, Constanza; Zhang, Qi; Keles, Sündüz.

Genet Epidemiol ; 43(7): 742-760, 2019 10.

Artículo en Inglés | MEDLINE | ID: mdl-31328826

RESUMEN

Genome-wide association studies (GWAS) have successfully identified thousands of genetic variants contributing to disease and other phenotypes. However, significant obstacles hamper our ability to elucidate causal variants, identify genes affected by causal variants, and characterize the mechanisms by which genotypes influence phenotypes. The increasing availability of genome-wide functional annotation data is providing unique opportunities to incorporate prior information into the analysis of GWAS to better understand the impact of variants on disease etiology. Although there have been many advances in incorporating prior information into prioritization of trait-associated variants in GWAS, functional annotation data have played a secondary role in the joint analysis of GWAS and molecular (i.e., expression) quantitative trait loci (eQTL) data in assessing evidence for association. To address this, we develop a novel mediation framework, iFunMed, to integrate GWAS and eQTL data with the utilization of publicly available functional annotation data. iFunMed extends the scope of standard mediation analysis by incorporating information from multiple genetic variants at a time and leveraging variant-level summary statistics. Data-driven computational experiments convey how informative annotations improve single-nucleotide polymorphism (SNP) selection performance while emphasizing robustness of iFunMed to noninformative annotations. Application to Framingham Heart Study data indicates that iFunMed is able to boost detection of SNPs with mediation effects that can be attributed to regulatory mechanisms.

Asunto(s)

Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo/genética , Programas Informáticos , Secuencia de Bases , Simulación por Computador , Recuento de Eritrocitos , Genotipo , Humanos , Anotación de Secuencia Molecular , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Probabilidad

Perm-seq: Mapping Protein-DNA Interactions in Segmental Duplication and Highly Repetitive Regions of Genomes with Prior-Enhanced Read Mapping.

Zeng, Xin; Li, Bo; Welch, Rene; Rojo, Constanza; Zheng, Ye; Dewey, Colin N; Keles, Sündüz.

PLoS Comput Biol ; 11(10): e1004491, 2015 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-26484757

RESUMEN

Segmental duplications and other highly repetitive regions of genomes contribute significantly to cells' regulatory programs. Advancements in next generation sequencing enabled genome-wide profiling of protein-DNA interactions by chromatin immunoprecipitation followed by high throughput sequencing (ChIP-seq). However, interactions in highly repetitive regions of genomes have proven difficult to map since short reads of 50-100 base pairs (bps) from these regions map to multiple locations in reference genomes. Standard analytical methods discard such multi-mapping reads and the few that can accommodate them are prone to large false positive and negative rates. We developed Perm-seq, a prior-enhanced read allocation method for ChIP-seq experiments, that can allocate multi-mapping reads in highly repetitive regions of the genomes with high accuracy. We comprehensively evaluated Perm-seq, and found that our prior-enhanced approach significantly improves multi-read allocation accuracy over approaches that do not utilize additional data types. The statistical formalism underlying our approach facilitates supervising of multi-read allocation with a variety of data sources including histone ChIP-seq. We applied Perm-seq to 64 ENCODE ChIP-seq datasets from GM12878 and K562 cells and identified many novel protein-DNA interactions in segmental duplication regions. Our analysis reveals that although the protein-DNA interactions sites are evolutionarily less conserved in repetitive regions, they share the overall sequence characteristics of the protein-DNA interactions in non-repetitive regions.

Asunto(s)

Mapeo Cromosómico/métodos , Proteínas de Unión al ADN/genética , ADN/genética , Mapeo de Interacción de Proteínas/métodos , Secuencias Repetitivas de Ácidos Nucleicos/genética , Duplicaciones Segmentarias en el Genoma/genética , Algoritmos , Secuencia de Bases , Inmunoprecipitación de Cromatina/métodos , ADN/química , Proteínas de Unión al ADN/química , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Células K562 , Datos de Secuencia Molecular

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

Detalles de la búsqueda