RESUMEN
Despite significant progress in annotating the genome with experimental methods, much of the regulatory noncoding genome remains poorly defined. Here we assert that regulatory elements may be characterized by leveraging local epigenomic signatures where specific transcription factors (TFs) are bound. To link these two features, we introduce IMPACT, a genome annotation strategy that identifies regulatory elements defined by cell-state-specific TF binding profiles, learned from 515 chromatin and sequence annotations. We validate IMPACT using multiple compelling applications. First, IMPACT distinguishes between bound and unbound TF motif sites with high accuracy (average AUPRC 0.81, SE 0.07; across 8 tested TFs) and outperforms state-of-the-art TF binding prediction methods, MocapG, MocapS, and Virtual ChIP-seq. Second, in eight tested cell types, RNA polymerase II IMPACT annotations capture more cis-eQTL variation than sequence-based annotations, such as promoters and TSS windows (25% average increase in enrichment). Third, integration with rheumatoid arthritis (RA) summary statistics from European (N = 38,242) and East Asian (N = 22,515) populations revealed that the top 5% of CD4+ Treg IMPACT regulatory elements capture 85.7% of RA h2, the most comprehensive explanation for RA h2 to date. In comparison, the average RA h2 captured by compared CD4+ T histone marks is 42.3% and by CD4+ T specifically expressed gene sets is 36.4%. Lastly, we find that IMPACT may be used in many different cell types to identify complex trait associated regulatory elements.
Asunto(s)
Artritis Reumatoide/metabolismo , Epigenoma , Epigenómica/métodos , Genoma Humano , Anotación de Secuencia Molecular , Secuencias Reguladoras de Ácidos Nucleicos , Factores de Transcripción/metabolismo , Artritis Reumatoide/genética , Cromatina/genética , Cromatina/metabolismo , Biología Computacional/métodos , Histonas/genética , Histonas/metabolismo , Humanos , Regiones Promotoras Genéticas , Unión Proteica , Factores de Transcripción/genéticaRESUMEN
Candida bloodstream infection, i.e. candidemia, is the most frequently encountered life-threatening fungal infection worldwide, with mortality rates up to almost 50%. In the majority of candidemia cases, Candida albicans is responsible. Worryingly, a global increase in the number of patients who are susceptible to infection (e.g. immunocompromised patients), has led to a rise in the incidence of candidemia in the last few decades. Therefore, a better understanding of the anti-Candida host response is essential to overcome this poor prognosis and to lower disease incidence. Here, we integrated genome-wide association studies with bulk and single-cell transcriptomic analyses of immune cells stimulated with Candida albicans to further our understanding of the anti-Candida host response. We show that differential expression analysis upon Candida stimulation in single-cell expression data can reveal the important cell types involved in the host response against Candida. This confirmed the known major role of monocytes, but more interestingly, also uncovered an important role for NK cells. Moreover, combining the power of bulk RNA-seq with the high resolution of single-cell RNA-seq data led to the identification of 27 Candida-response QTLs and revealed the cell types potentially involved herein. Integration of these response QTLs with a GWAS on candidemia susceptibility uncovered a potential new role for LY86 in candidemia susceptibility. Finally, experimental follow-up confirmed that LY86 knockdown results in reduced monocyte migration towards the chemokine MCP-1, thereby implying that this reduced migration may underlie the increased susceptibility to candidemia. Altogether, our integrative systems genetics approach identifies previously unknown mechanisms underlying the immune response to Candida infection.
Asunto(s)
Antígenos de Superficie/genética , Antígenos de Superficie/inmunología , Candida albicans/fisiología , Candidiasis/genética , Candida albicans/inmunología , Candidemia/genética , Candidemia/inmunología , Candidemia/microbiología , Candidiasis/inmunología , Candidiasis/microbiología , Estudios de Cohortes , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Células Asesinas Naturales , Análisis de Secuencia de ARN , Análisis de la Célula IndividualRESUMEN
BACKGROUND: Aging is a multifactorial process that affects multiple tissues and is characterized by changes in homeostasis over time, leading to increased morbidity. Whole blood gene expression signatures have been associated with aging and have been used to gain information on its biological mechanisms, which are still not fully understood. However, blood is composed of many cell types whose proportions in blood vary with age. As a result, previously observed associations between gene expression levels and aging might be driven by cell type composition rather than intracellular aging mechanisms. To overcome this, previous aging studies already accounted for major cell types, but the possibility that the reported associations are false positives driven by less prevalent cell subtypes remains. RESULTS: Here, we compared the regression model from our previous work to an extended model that corrects for 33 additional white blood cell subtypes. Both models were applied to whole blood gene expression data from 3165 individuals belonging to the general population (age range of 18-81 years). We evaluated that the new model is a better fit for the data and it identified fewer genes associated with aging (625, compared to the 2808 of the initial model; P ≤ 2.5⨯10-6). Moreover, 511 genes (~ 18% of the 2808 genes identified by the initial model) were found using both models, indicating that the other previously reported genes could be proxies for less abundant cell types. In particular, functional enrichment of the genes identified by the new model highlighted pathways and GO terms specifically associated with platelet activity. CONCLUSIONS: We conclude that gene expression analyses in blood strongly benefit from correction for both common and rare blood cell types, and recommend using blood-cell count estimates as standard covariates when studying whole blood gene expression.
Asunto(s)
Envejecimiento , Transcriptoma , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Envejecimiento/genética , Humanos , Persona de Mediana Edad , Adulto JovenRESUMEN
Epistasis is the phenomenon whereby one polymorphism's effect on a trait depends on other polymorphisms present in the genome. The extent to which epistasis influences complex traits and contributes to their variation is a fundamental question in evolution and human genetics. Although often demonstrated in artificial gene manipulation studies in model organisms, and some examples have been reported in other species, few examples exist for epistasis among natural polymorphisms in human traits. Its absence from empirical findings may simply be due to low incidence in the genetic control of complex traits, but an alternative view is that it has previously been too technically challenging to detect owing to statistical and computational issues. Here we show, using advanced computation and a gene expression study design, that many instances of epistasis are found between common single nucleotide polymorphisms (SNPs). In a cohort of 846 individuals with 7,339 gene expression levels measured in peripheral blood, we found 501 significant pairwise interactions between common SNPs influencing the expression of 238 genes (P < 2.91 × 10(-16)). Replication of these interactions in two independent data sets showed both concordance of direction of epistatic effects (P = 5.56 × 10(-31)) and enrichment of interaction P values, with 30 being significant at a conservative threshold of P < 9.98 × 10(-5). Forty-four of the genetic interactions are located within 5 megabases of regions of known physical chromosome interactions (P = 1.8 × 10(-10)). Epistatic networks of three SNPs or more influence the expression levels of 129 genes, whereby one cis-acting SNP is modulated by several trans-acting SNPs. For example, MBNL1 is influenced by an additive effect at rs13069559, which itself is masked by trans-SNPs on 14 different chromosomes, with nearly identical genotype-phenotype maps for each cis-trans interaction. This study presents the first evidence, to our knowledge, for many instances of segregating common polymorphisms interacting to influence human traits.
Asunto(s)
Epistasis Genética/genética , Regulación de la Expresión Génica/genética , Transcripción Genética/genética , Estudios de Cohortes , Europa (Continente)/etnología , Femenino , Perfilación de la Expresión Génica , Estudios de Asociación Genética , Humanos , Desequilibrio de Ligamiento , Masculino , Linaje , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo , Reproducibilidad de los ResultadosRESUMEN
A major challenge in human genetics is to devise a systematic strategy to integrate disease-associated variants with diverse genomic and biological data sets to provide insight into disease pathogenesis and guide drug discovery for complex traits such as rheumatoid arthritis (RA). Here we performed a genome-wide association study meta-analysis in a total of >100,000 subjects of European and Asian ancestries (29,880 RA cases and 73,758 controls), by evaluating â¼10 million single-nucleotide polymorphisms. We discovered 42 novel RA risk loci at a genome-wide level of significance, bringing the total to 101 (refs 2 - 4). We devised an in silico pipeline using established bioinformatics methods based on functional annotation, cis-acting expression quantitative trait loci and pathway analyses--as well as novel methods based on genetic overlap with human primary immunodeficiency, haematological cancer somatic mutations and knockout mouse phenotypes--to identify 98 biological candidate genes at these 101 risk loci. We demonstrate that these genes are the targets of approved therapies for RA, and further suggest that drugs approved for other indications may be repurposed for the treatment of RA. Together, this comprehensive genetic study sheds light on fundamental genes, pathways and cell types that contribute to RA pathogenesis, and provides empirical evidence that the genetics of RA can provide important information for drug discovery.
Asunto(s)
Artritis Reumatoide/tratamiento farmacológico , Artritis Reumatoide/genética , Descubrimiento de Drogas , Predisposición Genética a la Enfermedad/genética , Terapia Molecular Dirigida , Alelos , Animales , Artritis Reumatoide/metabolismo , Artritis Reumatoide/patología , Pueblo Asiatico/genética , Estudios de Casos y Controles , Biología Computacional , Reposicionamiento de Medicamentos , Femenino , Estudio de Asociación del Genoma Completo , Neoplasias Hematológicas/genética , Neoplasias Hematológicas/metabolismo , Humanos , Masculino , Ratones , Ratones Noqueados , Polimorfismo de Nucleótido Simple/genética , Población Blanca/genéticaRESUMEN
Inappropriate activation or inadequate regulation of CD4+ and CD8+ T cells may contribute to the initiation and progression of multiple autoimmune and inflammatory diseases. Studies on disease-associated genetic polymorphisms have highlighted the importance of biological context for many regulatory variants, which is particularly relevant in understanding the genetic regulation of the immune system and its cellular phenotypes. Here we show cell type-specific regulation of transcript levels of genes associated with several autoimmune diseases in CD4+ and CD8+ T cells including a trans-acting regulatory locus at chr12q13.2 containing the rs1131017 SNP in the RPS26 gene. Most remarkably, we identify a common missense variant in IL27, associated with type 1 diabetes that results in decreased functional activity of the protein and reduced expression levels of downstream IRF1 and STAT1 in CD4+ T cells only. Altogether, our results indicate that eQTL mapping in purified T cells provides novel functional insights into polymorphisms and pathways associated with autoimmune diseases.
Asunto(s)
Enfermedades Autoinmunes/genética , Linfocitos T CD4-Positivos/metabolismo , Linfocitos T CD8-positivos/metabolismo , Sitios de Carácter Cuantitativo/genética , Linfocitos T CD4-Positivos/inmunología , Linfocitos T CD8-positivos/inmunología , Mapeo Cromosómico/métodos , Diabetes Mellitus Tipo 1/genética , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Células HEK293 , Humanos , Factor 1 Regulador del Interferón/genética , Interleucina-27/genética , Mutación , Polimorfismo de Nucleótido Simple , Proteínas Ribosómicas/genética , Factor de Transcripción STAT1/genéticaRESUMEN
[This corrects the article DOI: 10.1371/journal.pgen.1005378.].
RESUMEN
Identifying genomic annotations that differentiate causal from trait-associated variants is essential to fine mapping disease loci. Although many studies have identified non-coding functional annotations that overlap disease-associated variants, these annotations often colocalize, complicating the ability to use these annotations for fine mapping causal variation. We developed a statistical approach (Genomic Annotation Shifter [GoShifter]) to assess whether enriched annotations are able to prioritize causal variation. GoShifter defines the null distribution of an annotation overlapping an allele by locally shifting annotations; this approach is less sensitive to biases arising from local genomic structure than commonly used enrichment methods that depend on SNP matching. Local shifting also allows GoShifter to identify independent causal effects from colocalizing annotations. Using GoShifter, we confirmed that variants in expression quantitative trail loci drive gene-expression changes though DNase-I hypersensitive sites (DHSs) near transcription start sites and independently through 3' UTR regulation. We also showed that (1) 15%-36% of trait-associated loci map to DHSs independently of other annotations; (2) loci associated with breast cancer and rheumatoid arthritis harbor potentially causal variants near the summits of histone marks rather than full peak bodies; (3) variants associated with height are highly enriched in embryonic stem cell DHSs; and (4) we can effectively prioritize causal variation at specific loci.
Asunto(s)
Regulación de la Expresión Génica/genética , Variación Genética , Genoma Humano/genética , Anotación de Secuencia Molecular/métodos , Sitios de Carácter Cuantitativo/genética , Artritis Reumatoide/genética , Neoplasias de la Mama/genética , Histonas/genética , Histonas/metabolismo , HumanosRESUMEN
The functional consequences of trait associated SNPs are often investigated using expression quantitative trait locus (eQTL) mapping. While trait-associated variants may operate in a cell-type specific manner, eQTL datasets for such cell-types may not always be available. We performed a genome-environment interaction (GxE) meta-analysis on data from 5,683 samples to infer the cell type specificity of whole blood cis-eQTLs. We demonstrate that this method is able to predict neutrophil and lymphocyte specific cis-eQTLs and replicate these predictions in independent cell-type specific datasets. Finally, we show that SNPs associated with Crohn's disease preferentially affect gene expression within neutrophils, including the archetypal NOD2 locus.
Asunto(s)
Linfocitos/citología , Neutrófilos/citología , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Línea Celular , Enfermedad de Crohn/genética , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo/métodos , Humanos , Linfocitos/metabolismo , Neutrófilos/metabolismo , Proteína Adaptadora de Señalización NOD2/genética , Proteína Adaptadora de Señalización NOD2/metabolismo , Fenotipo , Análisis de Componente Principal , Reproducibilidad de los ResultadosRESUMEN
Genome-wide association studies (GWAS) have identified more than 100 genetic variants contributing to BMI, a measure of body size, or waist-to-hip ratio (adjusted for BMI, WHRadjBMI), a measure of body shape. Body size and shape change as people grow older and these changes differ substantially between men and women. To systematically screen for age- and/or sex-specific effects of genetic variants on BMI and WHRadjBMI, we performed meta-analyses of 114 studies (up to 320,485 individuals of European descent) with genome-wide chip and/or Metabochip data by the Genetic Investigation of Anthropometric Traits (GIANT) Consortium. Each study tested the association of up to ~2.8M SNPs with BMI and WHRadjBMI in four strata (men ≤50y, men >50y, women ≤50y, women >50y) and summary statistics were combined in stratum-specific meta-analyses. We then screened for variants that showed age-specific effects (G x AGE), sex-specific effects (G x SEX) or age-specific effects that differed between men and women (G x AGE x SEX). For BMI, we identified 15 loci (11 previously established for main effects, four novel) that showed significant (FDR<5%) age-specific effects, of which 11 had larger effects in younger (<50y) than in older adults (≥50y). No sex-dependent effects were identified for BMI. For WHRadjBMI, we identified 44 loci (27 previously established for main effects, 17 novel) with sex-specific effects, of which 28 showed larger effects in women than in men, five showed larger effects in men than in women, and 11 showed opposite effects between sexes. No age-dependent effects were identified for WHRadjBMI. This is the first genome-wide interaction meta-analysis to report convincing evidence of age-dependent genetic effects on BMI. In addition, we confirm the sex-specificity of genetic effects on WHRadjBMI. These results may provide further insights into the biology that underlies weight change with age or the sexually dimorphism of body shape.
Asunto(s)
Índice de Masa Corporal , Tamaño Corporal/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Adulto , Factores de Edad , Anciano , Mapeo Cromosómico , Femenino , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Caracteres Sexuales , Relación Cintura-Cadera , Población BlancaRESUMEN
Although genome-wide association studies and fine mapping have identified 39 non-HLA loci associated with celiac disease (CD), it is difficult to pinpoint the functional variants and susceptibility genes in these loci. We applied integrative approaches to annotate and prioritize functional single nucleotide polymorphisms (SNPs), genes and pathways affected in CD. CD-associated SNPs were intersected with regulatory elements categorized by the ENCODE project to prioritize functional variants, while results from cis-expression quantitative trait loci (eQTL) mapping in 1469 blood samples were combined with co-expression analyses to prioritize causative genes. To identify the key cell types involved in CD, we performed pathway analysis on RNA-sequencing data from different immune cell populations and on publicly available expression data on non-immune tissues. We discovered that CD SNPs are significantly enriched in B-cell-specific enhancer regions, suggesting that, besides T-cell processes, B-cell responses play a major role in CD. By combining eQTL and co-expression analyses, we prioritized 43 susceptibility genes in 36 loci. Pathway and tissue-specific expression analyses on these genes suggested enrichment of CD genes in the Th1, Th2 and Th17 pathways, but also predicted a role for four genes in the intestinal barrier function. We also discovered an intricate transcriptional connectivity between CD susceptibility genes and interferon-γ, a key effector in CD, despite the absence of CD-associated SNPs in the IFNG locus. Using systems biology, we prioritized the CD-associated functional SNPs and genes. By highlighting a role for B cells in CD, which classically has been described as a T-cell-driven disease, we offer new insights into the mechanisms and pathways underlying CD.
Asunto(s)
Enfermedad Celíaca/genética , Interferón gamma/metabolismo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Enfermedad Celíaca/metabolismo , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Interferón gamma/genética , Anotación de Secuencia MolecularRESUMEN
A large fraction of human genes are regulated by genetic variation near the transcribed sequence (cis-eQTL, expression quantitative trait locus), and many cis-eQTLs have implications for human disease. Less is known regarding the effects of genetic variation on expression of distant genes (trans-eQTLs) and their biological mechanisms. In this work, we use genome-wide data on SNPs and array-based expression measures from mononuclear cells obtained from a population-based cohort of 1,799 Bangladeshi individuals to characterize cis- and trans-eQTLs and determine if observed trans-eQTL associations are mediated by expression of transcripts in cis with the SNPs showing trans-association, using Sobel tests of mediation. We observed 434 independent trans-eQTL associations at a false-discovery rate of 0.05, and 189 of these trans-eQTLs were also cis-eQTLs (enrichment P<0.0001). Among these 189 trans-eQTL associations, 39 were significantly attenuated after adjusting for a cis-mediator based on Sobel P<10-5. We attempted to replicate 21 of these mediation signals in two European cohorts, and while only 7 trans-eQTL associations were present in one or both cohorts, 6 showed evidence of cis-mediation. Analyses of simulated data show that complete mediation will be observed as partial mediation in the presence of mediator measurement error or imperfect LD between measured and causal variants. Our data demonstrates that trans-associations can become significantly stronger or switch directions after adjusting for a potential mediator. Using simulated data, we demonstrate that this phenomenon is expected in the presence of strong cis-trans confounding and when the measured cis-transcript is correlated with the true (unmeasured) mediator. In conclusion, by applying mediation analysis to eQTL data, we show that a substantial fraction of observed trans-eQTL associations can be explained by cis-mediation. Future studies should focus on understanding the mechanisms underlying widespread cis-mediation and their relevance to disease biology, as well as using mediation analysis to improve eQTL discovery.
Asunto(s)
Pueblo Asiatico/genética , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Asia/epidemiología , Pueblo Asiatico/estadística & datos numéricos , Bangladesh/epidemiología , Quimioprevención , Simulación por Computador , Perfilación de la Expresión Génica , Variación Genética , Humanos , Selenio/uso terapéutico , Neoplasias Cutáneas/epidemiología , Neoplasias Cutáneas/genética , Neoplasias Cutáneas/prevención & control , Vitamina E/uso terapéuticoRESUMEN
Using the Immunochip for genotyping, we identified 39 non-human leukocyte antigen (non-HLA) loci associated to celiac disease (CeD), an immune-mediated disease with a worldwide frequency of â¼1%. The most significant non-HLA signal mapped to the intronic region of 70 kb in the LPP gene. Our aim was to fine map and identify possible functional variants in the LPP locus. We performed a meta-analysis in a cohort of 25 169 individuals from six different populations previously genotyped using Immunochip. Imputation using data from the Genome of the Netherlands and 1000 Genomes projects, followed by meta-analysis, confirmed the strong association signal on the LPP locus (rs2030519, P = 1.79 × 10(-49)), without any novel associations. The conditional analysis on this top SNP-indicated association to a single common haplotype. By performing haplotype analyses in each population separately, as well as in a combined group of the four populations that reach the significant threshold after correction (P < 0.008), we narrowed down the CeD-associated region from 70 to 2.8 kb (P = 1.35 × 10(-44)). By intersecting regulatory data from the ENCODE project, we found a functional SNP, rs4686484 (P = 3.12 × 10(-49)), that maps to several B-cell enhancer elements and a highly conserved region. This SNP was also predicted to change the binding motif of the transcription factors IRF4, IRF11, Nkx2.7 and Nkx2.9, suggesting its role in transcriptional regulation. We later found significantly low levels of LPP mRNA in CeD biopsies compared with controls, thus our results suggest that rs4686484 is the functional variant in this locus, while LPP expression is decreased in CeD.
Asunto(s)
Enfermedad Celíaca/genética , Proteínas del Citoesqueleto/genética , Proteínas con Dominio LIM/genética , Polimorfismo de Nucleótido Simple/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Haplotipos , Humanos , Factores Reguladores del Interferón/genética , Desequilibrio de Ligamiento , Factores de Transcripción/genéticaRESUMEN
Recently it has become clear that only a small percentage (7%) of disease-associated single nucleotide polymorphisms (SNPs) are located in protein-coding regions, while the remaining 93% are located in gene regulatory regions or in intergenic regions. Thus, the understanding of how genetic variations control the expression of non-coding RNAs (in a tissue-dependent manner) has far-reaching implications. We tested the association of SNPs with expression levels (eQTLs) of large intergenic non-coding RNAs (lincRNAs), using genome-wide gene expression and genotype data from five different tissues. We identified 112 cis-regulated lincRNAs, of which 45% could be replicated in an independent dataset. We observed that 75% of the SNPs affecting lincRNA expression (lincRNA cis-eQTLs) were specific to lincRNA alone and did not affect the expression of neighboring protein-coding genes. We show that this specific genotype-lincRNA expression correlation is tissue-dependent and that many of these lincRNA cis-eQTL SNPs are also associated with complex traits and diseases.
Asunto(s)
Regulación de la Expresión Génica , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo/genética , ARN Largo no Codificante , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , Distribución TisularRESUMEN
Many disease-associated variants affect gene expression levels (expression quantitative trait loci, eQTLs) and expression profiling using next generation sequencing (NGS) technology is a powerful way to detect these eQTLs. We analyzed 94 total blood samples from healthy volunteers with DeepSAGE to gain specific insight into how genetic variants affect the expression of genes and lengths of 3'-untranslated regions (3'-UTRs). We detected previously unknown cis-eQTL effects for GWAS hits in disease- and physiology-associated traits. Apart from cis-eQTLs that are typically easily identifiable using microarrays or RNA-sequencing, DeepSAGE also revealed many cis-eQTLs for antisense and other non-coding transcripts, often in genomic regions containing retrotransposon-derived elements. We also identified and confirmed SNPs that affect the usage of alternative polyadenylation sites, thereby potentially influencing the stability of messenger RNAs (mRNA). We then combined the power of RNA-sequencing with DeepSAGE by performing a meta-analysis of three datasets, leading to the identification of many more cis-eQTLs. Our results indicate that DeepSAGE data is useful for eQTL mapping of known and unknown transcripts, and for identifying SNPs that affect alternative polyadenylation. Because of the inherent differences between DeepSAGE and RNA-sequencing, our complementary, integrative approach leads to greater insight into the molecular consequences of many disease-associated variants.
Asunto(s)
Regulación de la Expresión Génica/genética , Poliadenilación/genética , Sitios de Carácter Cuantitativo/genética , Retroelementos/genética , Regiones no Traducidas 3'/genética , Estudio de Asociación del Genoma Completo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Brain-derived neurotrophic factor (BDNF) is a secretory protein that has been implicated in the pathogenesis of allergic rhinitis (AR), atopic asthma, and eczema, but it is currently unknown whether BDNF polymorphisms influence susceptibility to moderate-to-severe AR. OBJECTIVE: We sought to identify disease associations and the functional effect of BDNF genetic variants in patients with moderate-to-severe AR. METHODS: Tagging single nucleotide polymorphisms (SNPs) spanning the BDNF gene were selected from the human HapMap Han Chinese from Beijing (CHB) data set, and associations with moderate-to-severe AR were assessed in 2 independent cohorts of Chinese patients (2216 from Shandong province and 1239 living in Singapore). The functional effects of the BDNF genetic variants were determined by using both in vitro and ex vivo assays. RESULTS: The tagging SNP rs10767664 was significantly associated with the risk of moderate-to-severe AR in both Singapore Chinese (P = .0017; odds ratio, 1.324) and Shandong Chinese populations (P = .039; odds ratio, 1.180). The coding nonsynonymous SNP rs6265 was in perfect linkage with rs10767664 and conferred increased BDNF protein secretion by a human cell line in vitro. Subjects bearing the AA genotype of rs10767664 exhibited increased risk of moderate-to-severe AR and displayed increased BDNF protein and total IgE levels in plasma. Using a large-scale expression quantitative trait locus study, we demonstrated that BDNF SNPs are significantly associated with altered BDNF concentrations in peripheral blood. CONCLUSION: A common genetic variant of the BDNF gene is associated with increased risk of moderate-to-severe AR, and the AA genotype is associated with increased BDNF mRNA levels in peripheral blood. Together, these data indicate that functional BDNF gene variants increase the risk of moderate-to-severe AR.
Asunto(s)
Factor Neurotrófico Derivado del Encéfalo/genética , Predisposición Genética a la Enfermedad , Inmunoglobulina E/sangre , Polimorfismo de Nucleótido Simple , ARN Mensajero/genética , Rinitis Alérgica/genética , Adolescente , Adulto , Pueblo Asiatico , Factor Neurotrófico Derivado del Encéfalo/sangre , Niño , Preescolar , China , Estudios de Cohortes , Femenino , Expresión Génica , Humanos , Masculino , Persona de Mediana Edad , Oportunidad Relativa , Sitios de Carácter Cuantitativo , ARN Mensajero/sangre , Rinitis Alérgica/sangre , Rinitis Alérgica/etnología , Rinitis Alérgica/patología , Riesgo , Índice de Severidad de la Enfermedad , SingapurRESUMEN
Genome-wide association studies (GWASs) have shown a large number of genetic variants to be associated with complex diseases. The identification of the causal variant within an associated locus can sometimes be difficult because of the linkage disequilibrium between the associated variants and because most GWAS loci contain multiple genes, or no genes at all. Expression quantitative trait locus (eQTL) mapping is a method used to determine the effects of genetic variants on gene expression levels. eQTL mapping studies have enabled the prioritization of genetic variants within GWAS loci, and have shown that trait-associated single nucleotide polymorphisms (SNPs) often function in a tissue- or cell type-specific manner, sometimes having downstream effects on completely different chromosomes. Furthermore, recent RNA-sequencing (RNA-seq) studies have shown that a large repertoire of transcripts is available in cells, which are actively regulated by (trait-associated) variants. Future eQTL mapping studies will focus on broadening the range of available tissues and cell types, in order to determine the key tissues and cell types involved in complex traits. Finally, large meta-analyses will be able to pinpoint the causal variants within the trait-associated loci and determine their downstream effects in greater detail. This article is part of a Special Issue entitled: From Genome to Function.
RESUMEN
Many disorders are associated with altered serum protein concentrations, including malnutrition, cancer, and cardiovascular, kidney, and inflammatory diseases. Although these protein concentrations are highly heritable, relatively little is known about their underlying genetic determinants. Through transethnic meta-analysis of European-ancestry and Japanese genome-wide association studies, we identified six loci at genome-wide significance (p < 5 × 10(-8)) for serum albumin (HPN-SCN1B, GCKR-FNDC4, SERPINF2-WDR81, TNFRSF11A-ZCCHC2, FRMD5-WDR76, and RPS11-FCGRT, in up to 53,190 European-ancestry and 9,380 Japanese individuals) and three loci for total protein (TNFRS13B, 6q21.3, and ELL2, in up to 25,539 European-ancestry and 10,168 Japanese individuals). We observed little evidence of heterogeneity in allelic effects at these loci between groups of European and Japanese ancestry but obtained substantial improvements in the resolution of fine mapping of potential causal variants by leveraging transethnic differences in the distribution of linkage disequilibrium. We demonstrated a functional role for the most strongly associated serum albumin locus, HPN, for which Hpn knockout mice manifest low plasma albumin concentrations. Other loci associated with serum albumin harbor genes related to ribosome function, protein translation, and proteasomal degradation, whereas those associated with serum total protein include genes related to immune function. Our results highlight the advantages of transethnic meta-analysis for the discovery and fine mapping of complex trait loci and have provided initial insights into the underlying genetic architecture of serum protein concentrations and their association with human disease.