RESUMO
The Human Leukocyte Antigen (HLA) is a critical genetic system for different outcomes after solid organ and hematopoietic cell transplantation. Its polymorphism is usually determined by molecular technologies at the DNA level. A potential role of HLA allelic expression remains under investigation in the context of the allogenic immune response between donors and recipients. In this study, we quantified the allelic expression of all three HLA class I loci (HLA-A, B and C) by RNA sequencing and conducted an analysis of expression quantitative traits loci (eQTL) to investigate whether HLA expression regulation could be associated with non-coding gene variations. HLA-B alleles exhibited the highest expression levels followed by HLA-C and HLA-A alleles. The max fold expression variation was observed for HLA-C alleles. The expression of HLA class I loci of distinct individuals demonstrated a coordinated and paired expression of both alleles of the same locus. Expression of conserved HLA-A~B~C haplotypes differed in distinct PBMC's suggesting an individual regulated expression of both HLA class I alleles and haplotypes. Cytokines TNFα /IFNß, which induced a very similar upregulation of HLA class I RNA and cell surface expression across alleles did not modify the individually coordinated expression at the three HLA class I loci. By identifying cis eQTLs for the HLA class I genes, we show that the non-coding eQTLs explain 29%, 13%, and 31% of the respective HLA-A, B, C expression variance in unstimulated cells, and 9%, 23%, and 50% of the variance in cytokine-stimulated cells. The eQTLs have significantly higher effect sizes in stimulated cells compared to unstimulated cells for HLA-B and HLA-C genes expression. Our data also suggest that the identified eQTLs are independent from the coding variation which defines HLA alleles and thus may be influential on intra-allele expression variability although they might not represent the causal eQTLs.
Assuntos
Antígenos HLA-C , Leucócitos Mononucleares , Alelos , Frequência do Gene , Antígenos HLA , Antígenos HLA-A/genética , Antígenos HLA-B/genética , Antígenos HLA-C/genética , Haplótipos , Antígenos de Histocompatibilidade Classe I/genética , Antígenos de Histocompatibilidade Classe II/genética , HumanosRESUMO
BACKGROUND: Expression quantitative trait loci (eQTL) studies provide insights into regulatory mechanisms underlying disease risk. Expanding studies of gene regulation to underexplored populations and to medically relevant tissues offers potential to reveal yet unknown regulatory variants and to better understand disease mechanisms. Here, we performed eQTL mapping in subcutaneous (S) and visceral (V) adipose tissue from 106 Greek individuals (Greek Metabolic study, GM) and compared our findings to those from the Genotype-Tissue Expression (GTEx) resource. RESULTS: We identified 1,930 and 1,515 eGenes in S and V respectively, over 13% of which are not observed in GTEx adipose tissue, and that do not arise due to different ancestry. We report additional context-specific regulatory effects in genes of clinical interest (e.g. oncogene ST7) and in genes regulating responses to environmental stimuli (e.g. MIR21, SNX33). We suggest that a fraction of the reported differences across populations is due to environmental effects on gene expression, driving context-specific eQTLs, and suggest that environmental effects can determine the penetrance of disease variants thus shaping disease risk. We report that over half of GM eQTLs colocalize with GWAS SNPs and of these colocalizations 41% are not detected in GTEx. We also highlight the clinical relevance of S adipose tissue by revealing that inflammatory processes are upregulated in individuals with obesity, not only in V, but also in S tissue. CONCLUSIONS: By focusing on an understudied population, our results provide further candidate genes for investigation regarding their role in adipose tissue biology and their contribution to disease risk and pathogenesis.
Assuntos
Predisposição Genética para Doença , Locos de Características Quantitativas , Humanos , Grécia , Regulação da Expressão Gênica , Genótipo , Polimorfismo de Nucleotídeo Único , Estudo de Associação Genômica Ampla/métodosRESUMO
Schizophrenia is a polygenic psychiatric disorder with limited understanding about the mechanistic changes in gene expression regulation. To elucidate on this, we integrate interindividual variability of regulatory activity (ChIP-sequencing for H3K27ac histone mark) with gene expression and genotype data captured from the prefrontal cortex of 272 cases and controls. By measuring interindividual correlation among proximal chromatin peaks, we show that regulatory element activity is structured into 10,936 and 10,376 cis-regulatory domains in cases and controls, respectively. The schizophrenia-specific cis-regulatory domains are enriched for fetal-specific (p = 0.0014, OR = 1.52) and depleted of adult-specific regulatory activity (p = 3.04 × 10-50, OR = 0.57) and are enriched for SCZ heritability (p = 0.001). By studying the interplay among genetic variants, gene expression, and cis-regulatory domains, we ascertain that changes in coordinated regulatory activity tag alterations in gene expression levels (p = 3.43 × 10-5, OR = 1.65), unveil case-specific QTL effects, and identify regulatory machinery changes for genes affecting synaptic function and dendritic spine morphology in schizophrenia. Altogether, we show that accounting for coordinated regulatory activity provides a novel mechanistic approach to reduce the search space for unveiling genetically perturbed regulation of gene expression in schizophrenia.
Assuntos
Esquizofrenia , Adulto , Humanos , Esquizofrenia/genética , Regulação da Expressão Gênica , Córtex Pré-Frontal/metabolismo , Cromatina/metabolismo , Herança Multifatorial , Predisposição Genética para DoençaRESUMO
The cis-regulatory effects responsible for cancer development have not been as extensively studied as the perturbations of the protein coding genome in tumorigenesis. To better characterize colorectal cancer (CRC) development we conducted an RNA-sequencing experiment of 103 matched tumour and normal colon mucosa samples from Danish CRC patients, 90 of which were germline-genotyped. By investigating allele-specific expression (ASE) we show that the germline genotypes remain important determinants of allelic gene expression in tumours. Using the changes in ASE in matched pairs of samples we discover 71 genes with excess of somatic cis-regulatory effects in CRC, suggesting a cancer driver role. We correlate genotypes and gene expression to identify expression quantitative trait loci (eQTLs) and find 1,693 and 948 eQTLs in normal samples and tumours, respectively. We estimate that 36% of the tumour eQTLs are exclusive to CRC and show that this specificity is partially driven by increased expression of specific transcription factors and changes in methylation patterns. We show that tumour-specific eQTLs are more enriched for low CRC genome-wide association study (GWAS) P values than shared eQTLs, which suggests that some of the GWAS variants are tumour specific regulatory variants. Importantly, tumour-specific eQTL genes also accumulate more somatic mutations when compared to the shared eQTL genes, raising the possibility that they constitute germline-derived cancer regulatory drivers. Collectively the integration of genome and the transcriptome reveals a substantial number of putative somatic and germline cis-regulatory cancer changes that may have a role in tumorigenesis.
Assuntos
Neoplasias Colorretais/genética , Regulação Neoplásica da Expressão Gênica/genética , Sequências Reguladoras de Ácido Nucleico/genética , Alelos , Transformação Celular Neoplásica/genética , Transformação Celular Neoplásica/patologia , Neoplasias Colorretais/patologia , Metilação de DNA , Perfilação da Expressão Gênica , Genes Neoplásicos , Estudo de Associação Genômica Ampla , Genótipo , Mutação em Linhagem Germinativa/genética , Humanos , Mucosa Intestinal/citologia , Mucosa Intestinal/metabolismo , Mucosa Intestinal/patologia , Locos de Características Quantitativas/genética , Análise de Sequência de RNA , Fatores de Transcrição/metabolismo , Transcriptoma/genéticaRESUMO
OBJECTIVES: Systemic lupus erythematosus (SLE) diagnosis and treatment remain empirical and the molecular basis for its heterogeneity elusive. We explored the genomic basis for disease susceptibility and severity. METHODS: mRNA sequencing and genotyping in blood from 142 patients with SLE and 58 healthy volunteers. Abundances of cell types were assessed by CIBERSORT and cell-specific effects by interaction terms in linear models. Differentially expressed genes (DEGs) were used to train classifiers (linear discriminant analysis) of SLE versus healthy individuals in 80% of the dataset and were validated in the remaining 20% running 1000 iterations. Transcriptome/genotypes were integrated by expression-quantitative trail loci (eQTL) analysis; tissue-specific genetic causality was assessed by regulatory trait concordance (RTC). RESULTS: SLE has a 'susceptibility signature' present in patients in clinical remission, an 'activity signature' linked to genes that regulate immune cell metabolism, protein synthesis and proliferation, and a 'severity signature' best illustrated in active nephritis, enriched in druggable granulocyte and plasmablast/plasma-cell pathways. Patients with SLE have also perturbed mRNA splicing enriched in immune system and interferon signalling genes. A novel transcriptome index distinguished active versus inactive disease-but not low disease activity-and correlated with disease severity. DEGs discriminate SLE versus healthy individuals with median sensitivity 86% and specificity 92% suggesting a potential use in diagnostics. Combined eQTL analysis from the Genotype Tissue Expression (GTEx) project and SLE-associated genetic polymorphisms demonstrates that susceptibility variants may regulate gene expression in the blood but also in other tissues. CONCLUSION: Specific gene networks confer susceptibility to SLE, activity and severity, and may facilitate personalised care.
Assuntos
Perfilação da Expressão Gênica/métodos , Predisposição Genética para Doença/epidemiologia , Interferon Tipo I/genética , Lúpus Eritematoso Sistêmico/genética , Lúpus Eritematoso Sistêmico/imunologia , Adulto , Estudos de Casos e Controles , Progressão da Doença , Feminino , Variação Genética , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Fenótipo , RNA Mensageiro/genética , Valores de Referência , Transcriptoma/genética , Adulto JovemRESUMO
Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of messenger RNA and microRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project--the first uniformly processed high-throughput RNA-sequencing data from multiple human populations with high-quality genome sequences. We discover extremely widespread genetic variation affecting the regulation of most genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on the cellular mechanisms of regulatory and loss-of-function variation, and allows us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.
Assuntos
Variação Genética/genética , Genoma Humano/genética , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de RNA , Transcriptoma/genética , Alelos , Linhagem Celular Transformada , Éxons/genética , Perfilação da Expressão Gênica , Humanos , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , RNA Mensageiro/análise , RNA Mensageiro/genéticaRESUMO
OBJECTIVE: To elucidate the genetic architecture of gene expression in pancreatic tissues. DESIGN: We performed expression quantitative trait locus (eQTL) analysis in histologically normal pancreatic tissue samples (n=95) using RNA sequencing and the corresponding 1000 genomes imputed germline genotypes. Data from pancreatic tumour-derived tissue samples (n=115) from The Cancer Genome Atlas were included for comparison. RESULTS: We identified 38 615 cis-eQTLs (in 484 genes) in histologically normal tissues and 39 713 cis-eQTL (in 237 genes) in tumour-derived tissues (false discovery rate <0.1), with the strongest effects seen near transcriptional start sites. Approximately 23% and 42% of genes with significant cis-eQTLs appeared to be specific for tumour-derived and normal-derived tissues, respectively. Significant enrichment of cis-eQTL variants was noted in non-coding regulatory regions, in particular for pancreatic tissues (1.53-fold to 3.12-fold, p≤0.0001), indicating tissue-specific functional relevance. A common pancreatic cancer risk locus on 9q34.2 (rs687289) was associated with ABO expression in histologically normal (p=5.8×10-8) and tumour-derived (p=8.3×10-5) tissues. The high linkage disequilibrium between this variant and the O blood group generating deletion variant in ABO (exon 6) suggested that nonsense-mediated decay (NMD) of the 'O' mRNA might explain this finding. However, knockdown of crucial NMD regulators did not influence decay of the ABO 'O' mRNA, indicating that a gene regulatory element influenced by pancreatic cancer risk alleles may underlie the eQTL. CONCLUSIONS: We have identified cis-eQTLs representing potential functional regulatory variants in the pancreas and generated a rich data set for further studies on gene expression and its regulation in pancreatic tissues.
Assuntos
Sistema ABO de Grupos Sanguíneos/genética , Expressão Gênica , Pâncreas , Neoplasias Pancreáticas/genética , Locos de Características Quantitativas , RNA Neoplásico/análise , Transcriptoma , Alelos , Cromossomos Humanos Par 9 , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Degradação do RNAm Mediada por Códon sem Sentido , Polimorfismo de Nucleotídeo Único , Sequências Reguladoras de Ácido Nucleico , Análise de Sequência de RNARESUMO
With the advent of RNA-sequencing technology, we can detect different types of alternative splicing and determine how DNA variation regulates splicing. However, given the short read lengths used in most population-based RNA-sequencing experiments, quantifying transcripts accurately remains a challenge. Here we present a method, Altrans, for discovery of alternative splicing quantitative trait loci (asQTLs). To assess the performance of Altrans, we compared it to Cufflinks and MISO in simulations and Cufflinks for asQTL discovery. Simulations show that in the presence of unannotated transcripts, Altrans performs better in quantifications than Cufflinks and MISO. We have applied Altrans and Cufflinks to the Geuvadis dataset, which comprises samples from European and African populations, and discovered (FDR = 1%) 1,427 and 166 asQTLs with Altrans and 1,737 and 304 asQTLs with Cufflinks for Europeans and Africans, respectively. We show that, by discovering a set of asQTLs in a smaller subset of European samples and replicating these in the remaining larger subset of Europeans, both methods achieve similar replication levels (95% for both methods). We find many Altrans-specific asQTLs, which replicate to a high degree (93%). This is mainly due to junctions absent from the annotations and hence not tested with Cufflinks. The asQTLs are significantly enriched for biochemically active regions of the genome, functional marks, and variants in splicing regions, highlighting their biological relevance. We present an approach for discovering asQTLs that is a more direct assessment of splicing compared to other methods and is complementary to other transcript quantification methods.
Assuntos
Processamento Alternativo , População Negra/genética , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Locos de Características Quantitativas , Análise de Sequência de RNA/métodos , População Branca/genética , Algoritmos , Estudos de Coortes , Genoma Humano , Humanos , Isoformas de Proteínas , SoftwareRESUMO
Understanding how genetic variation affects distinct cellular phenotypes, such as gene expression levels, alternative splicing and DNA methylation levels, is essential for better understanding of complex diseases and traits. Furthermore, how inter-individual variation of DNA methylation is associated to gene expression is just starting to be studied. In this study, we use the GenCord cohort of 204 newborn Europeans' lymphoblastoid cell lines, T-cells and fibroblasts derived from umbilical cords. The samples were previously genotyped for 2.5 million SNPs, mRNA-sequenced, and assayed for methylation levels in 482,421 CpG sites. We observe that methylation sites associated to expression levels are enriched in enhancers, gene bodies and CpG island shores. We show that while the correlation between DNA methylation and gene expression can be positive or negative, it is very consistent across cell-types. However, this epigenetic association to gene expression appears more tissue-specific than the genetic effects on gene expression or DNA methylation (observed in both sharing estimations based on P-values and effect size correlations between cell-types). This predominance of genetic effects can also be reflected by the observation that allele specific expression differences between individuals dominate over tissue-specific effects. Additionally, we discover genetic effects on alternative splicing and interestingly, a large amount of DNA methylation correlating to alternative splicing, both in a tissue-specific manner. The locations of the SNPs and methylation sites involved in these associations highlight the participation of promoter proximal and distant regulatory regions on alternative splicing. Overall, our results provide high-resolution analyses showing how genome sequence variation has a broad effect on cellular phenotypes across cell-types, whereas epigenetic factors provide a secondary layer of variation that is more tissue-specific. Furthermore, the details of how this tissue-specificity may vary across inter-relations of molecular traits, and where these are occurring, can yield further insights into gene regulation and cellular biology as a whole.
Assuntos
Processamento Alternativo/genética , Metilação de DNA/genética , Epigênese Genética , Regulação da Expressão Gênica/genética , Variação Genética , Alelos , Ilhas de CpG , Humanos , Recém-Nascido , Especificidade de Órgãos , Polimorfismo de Nucleotídeo Único/genética , Regiões Promotoras Genéticas , Sequências Reguladoras de Ácido Nucleico/genéticaRESUMO
MOTIVATION: In order to discover quantitative trait loci, multi-dimensional genomic datasets combining DNA-seq and ChiP-/RNA-seq require methods that rapidly correlate tens of thousands of molecular phenotypes with millions of genetic variants while appropriately controlling for multiple testing. RESULTS: We have developed FastQTL, a method that implements a popular cis-QTL mapping strategy in a user- and cluster-friendly tool. FastQTL also proposes an efficient permutation procedure to control for multiple testing. The outcome of permutations is modeled using beta distributions trained from a few permutations and from which adjusted P-values can be estimated at any level of significance with little computational cost. The Geuvadis & GTEx pilot datasets can be now easily analyzed an order of magnitude faster than previous approaches. AVAILABILITY AND IMPLEMENTATION: Source code, binaries and comprehensive documentation of FastQTL are freely available to download at http://fastqtl.sourceforge.net/ CONTACT: emmanouil.dermitzakis@unige.ch or olivier.delaneau@unige.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Locos de Características Quantitativas , Genômica , Fenótipo , Software , Distribuições EstatísticasRESUMO
Elucidating the pathophysiology and molecular attributes of common disorders as well as developing targeted and effective treatments hinges on the study of the relevant cell type and tissues. Pancreatic beta cells within the islets of Langerhans are centrally involved in the pathogenesis of both type 1 and type 2 diabetes. Describing the differentiated state of the human beta cell has been hampered so far by technical (low resolution microarrays) and biological limitations (whole islet preparations rather than isolated beta cells). We circumvent these by deep RNA sequencing of purified beta cells from 11 individuals, presenting here the first characterization of the human beta cell transcriptome. We perform the first comparison of gene expression profiles between beta cells, whole islets, and beta cell depleted islet preparations, revealing thus beta-cell-specific expression and splicing signatures. Further, we demonstrate that genes with consistent increased expression in beta cells have neuronal-like properties, a signal previously hypothesized. Finally, we find evidence for extensive allelic imbalance in expression and uncover genetic regulatory variants (eQTLs) active in beta cells. This first molecular blueprint of the human beta cell offers biological insight into its differentiated function, including expression of key genes associated with both major types of diabetes.
Assuntos
Alelos , Células Secretoras de Insulina/metabolismo , Transcriptoma , Redes Reguladoras de Genes , Humanos , Especificidade de Órgãos , Locos de Características QuantitativasRESUMO
Height is a classic complex trait with common variants in a growing list of genes known to contribute to the phenotype. Using a genecentric genotyping array targeted toward cardiovascular-related loci, comprising 49,320 SNPs across approximately 2000 loci, we evaluated the association of common and uncommon SNPs with adult height in 114,223 individuals from 47 studies and six ethnicities. A total of 64 loci contained a SNP associated with height at array-wide significance (p < 2.4 × 10(-6)), with 42 loci surpassing the conventional genome-wide significance threshold (p < 5 × 10(-8)). Common variants with minor allele frequencies greater than 5% were observed to be associated with height in 37 previously reported loci. In individuals of European ancestry, uncommon SNPs in IL11 and SMAD3, which would not be genotyped with the use of standard genome-wide genotyping arrays, were strongly associated with height (p < 3 × 10(-11)). Conditional analysis within associated regions revealed five additional variants associated with height independent of lead SNPs within the locus, suggesting allelic heterogeneity. Although underpowered to replicate findings from individuals of European ancestry, the direction of effect of associated variants was largely consistent in African American, South Asian, and Hispanic populations. Overall, we show that dense coverage of genes for uncommon SNPs, coupled with large-scale meta-analysis, can successfully identify additional variants associated with a common complex trait.
Assuntos
Estatura/genética , Sistema Cardiovascular , Heterogeneidade Genética , Loci Gênicos , Polimorfismo de Nucleotídeo Único , Adulto , Negro ou Afro-Americano/genética , Povo Asiático/genética , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Hispânico ou Latino/genética , Humanos , Interleucina-11/genética , Masculino , Proteína Smad3/genética , População Branca/genéticaRESUMO
Transposable elements (TEs) are prevalent repeats in the human genome, play a significant role in the regulome, and their disruption can contribute to tumorigenesis. However, TE influence on gene expression in cancer remains unclear. Here, we analyze 275 normal colon and 276 colorectal cancer samples from the SYSCOL cohort, discovering 10,231 and 5,199 TE-expression quantitative trait loci (eQTLs) in normal and tumor tissues, respectively, of which 376 are colorectal cancer specific eQTLs, likely due to methylation changes. Tumor-specific TE-eQTLs show greater enrichment of transcription factors, compared to shared TE-eQTLs suggesting specific regulation of their expression in tumor. Bayesian networks reveal 1,766 TEs as mediators of genetic effects, altering the expression of 1,558 genes, including 55 known cancer driver genes and show that tumor-specific TE-eQTLs trigger the driver capability of TEs. These insights expand our knowledge of cancer drivers, deepening our understanding of tumorigenesis and presenting potential avenues for therapeutic interventions.
Assuntos
Neoplasias Colorretais , Elementos de DNA Transponíveis , Humanos , Elementos de DNA Transponíveis/genética , Teorema de Bayes , Fatores de Transcrição/metabolismo , Carcinogênese/genética , Neoplasias Colorretais/genéticaRESUMO
Gene silencing by DNA hypermethylation of CpG islands is a well-characterized phenomenon in cancer. The effect of hypomethylation in particular of non-CpG island genes is much less well described. By genome-wide screening, we identified 105 genes in microsatellite stable (MSS) colorectal adenocarcinomas with an inverse correlation (Spearman's ρ ≤ -0.40) between methylation and expression. Of these, 35 (33%) were hypomethylated non-CpG island genes and two of them, APOLD1 (Spearman's ρ = -0.82) and SRPX2 (Spearman's ρ = -0.80) were selected for further analyses. Hypomethylation of both genes were localized events not shared by adjacent genes. A set of 662 FFPE DNA samples not only confirmed that APOLD1 and SRPX2 are hypomethylated in CRC but also revealed hypomethylation to be significantly (p < 0.01) associated with tumors being localized in the left side, CpG island methylator phenotype negative, MSS, BRAF wt, undifferentiated and of adenocarcinoma histosubtype. Demethylation experiments supported SRPX2 being epigenetically regulated via DNA methylation, whereas other mechanisms in addition to DNA methylation seem to be involved in the regulation of APOLD1. We further identified miR-149 as a potential novel post-transcriptional regulator of SRPX2. In carcinoma tissue, miR-149 was downregulated and inversely correlated to SRPX2 (ρ = -0.77). Furthermore, ectopic expression of miR-149 significantly reduced SRPX2 transcript levels. Our study highlights that in colorectal tumors, hypomethylation of non-CpG island-associated promoters deregulate gene expression nearly as frequent as do CpG-island hypermethylation. The hypomethylation of SRPX2 is focal and not part of a large block. Furthermore, it often translates to an increased expression level, which may be modulated by miR-149.
Assuntos
Adenocarcinoma/genética , Neoplasias Colorretais/genética , Metilação de DNA , MicroRNAs/metabolismo , Proteínas do Tecido Nervoso/metabolismo , Regiões Promotoras Genéticas , Adenoma/genética , Apolipoproteínas/metabolismo , Ilhas de CpG , Regulação Neoplásica da Expressão Gênica , Humanos , Proteínas de Membrana , Instabilidade de Microssatélites , Proteínas de Neoplasias , Regiões Promotoras Genéticas/genética , Proteínas Proto-Oncogênicas B-raf/metabolismo , Reação em Cadeia da Polimerase em Tempo Real , Transcrição Gênica , TranscriptomaRESUMO
Genome-wide association studies have identified a region on chromosome 9p that is associated with coronary artery disease (CAD). The region is also associated with type 2 diabetes (T2D), a risk factor for CAD, although different SNPs were reported to be associated to each disease in separate studies. We have undertaken a case-control study in 4251 CAD cases and 4443 controls in four European populations using previously reported ('literature') and tagging SNPs. We replicated the literature SNPs (P = 8x10(-13); OR = 1.29; 95% CI: 1.20-1.38) and showed that the strong consistent association detected by these SNPs is a consequence of a 'yin-yang' haplotype pattern spanning 53 kb. There was no evidence of additional CAD susceptibility alleles over the major risk haplotype. CAD patients without myocardial infarction (MI) showed a trend towards stronger association than MI patients. The CAD susceptibility conferred by this locus did not differ by sex, age, smoking, obesity, hypertension or diabetes. A simultaneous test of CAD and diabetes susceptibility with CAD and T2D-associated SNPs indicated that these associations were independent of each other. Moreover, this region was not associated with differences in plasma levels of low-density lipoprotein cholesterol, high-density lipoprotein cholesterol, fibrinogen, albumin, uric acid, bilirubin or homocysteine, although the CAD-high-risk allele was paradoxically associated with lower triglyceride levels. A large antisense non-coding RNA gene (ANRIL) collocates with the high-risk haplotype, is expressed in tissues and cell types that are affected by atherosclerosis and is a prime candidate gene for the chromosome 9p CAD locus.
Assuntos
Cromossomos Humanos Par 9 , Doença da Artéria Coronariana/genética , Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , RNA não Traduzido/genética , Sequência de Bases , Primers do DNA , Haplótipos , Humanos , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Fatores de RiscoRESUMO
Transcriptional characterization and classification has potential to resolve the inter-tumor heterogeneity of colorectal cancer and improve patient management. Yet, robust transcriptional profiling is difficult using formalin-fixed, paraffin-embedded (FFPE) samples, which complicates testing in clinical and archival material. We present MethCORR, an approach that allows uniform molecular characterization and classification of fresh-frozen and FFPE samples. MethCORR identifies genome-wide correlations between RNA expression and DNA methylation in fresh-frozen samples. This information is used to infer gene expression information in FFPE samples from their methylation profiles. MethCORR is here applied to methylation profiles from 877 fresh-frozen/FFPE samples and comparative analysis identifies the same two subtypes in four independent cohorts. Furthermore, subtype-specific prognostic biomarkers that better predicts relapse-free survival (HR = 2.66, 95%CI [1.67-4.22], P value < 0.001 (log-rank test)) than UICC tumor, node, metastasis (TNM) staging and microsatellite instability status are identified and validated using DNA methylation-specific PCR. The MethCORR approach is general, and may be similarly successful for other cancer types.
Assuntos
Biomarcadores Tumorais/genética , Neoplasias Colorretais/mortalidade , Epigenoma/genética , Modelos Genéticos , Recidiva Local de Neoplasia/diagnóstico , Idoso , Colo/patologia , Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Metilação de DNA , Conjuntos de Dados como Assunto , Intervalo Livre de Doença , Feminino , Formaldeído , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Humanos , Mucosa Intestinal/patologia , Masculino , Pessoa de Meia-Idade , Recidiva Local de Neoplasia/genética , Inclusão em Parafina , Prognóstico , Reto/patologia , Medição de Risco/métodos , Fixação de TecidosRESUMO
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMO
We apply integrative approaches to expression quantitative loci (eQTLs) from 44 tissues from the Genotype-Tissue Expression project and genome-wide association study data. About 60% of known trait-associated loci are in linkage disequilibrium with a cis-eQTL, over half of which were not found in previous large-scale whole blood studies. Applying polygenic analyses to metabolic, cardiovascular, anthropometric, autoimmune, and neurodegenerative traits, we find that eQTLs are significantly enriched for trait associations in relevant pathogenic tissues and explain a substantial proportion of the heritability (40-80%). For most traits, tissue-shared eQTLs underlie a greater proportion of trait associations, although tissue-specific eQTLs have a greater contribution to some traits, such as blood pressure. By integrating information from biological pathways with eQTL target genes and applying a gene-based approach, we validate previously implicated causal genes and pathways, and propose new variant and gene associations for several complex traits, which we replicate in the UK BioBank and BioVU.
Assuntos
Doença/genética , Regulação da Expressão Gênica , Expressão Gênica , Perfilação da Expressão Gênica/métodos , Estudo de Associação Genômica Ampla/métodos , Genótipo , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Característica Quantitativa HerdávelRESUMO
Microsatellite instability (MSI) leads to accumulation of an excessive number of mutations in the genome, mostly small insertions and deletions. MSI colorectal cancers (CRCs), however, also contain more point mutations than microsatellite-stable (MSS) tumors, yet they have not been as comprehensively studied. To identify candidate driver genes affected by point mutations in MSI CRC, we ranked genes based on mutation significance while correcting for replication timing and gene expression utilizing an algorithm, MutSigCV Somatic point mutation data from the exome kit-targeted area from 24 exome-sequenced sporadic MSI CRCs and respective normals, and 12 whole-genome-sequenced sporadic MSI CRCs and respective normals were utilized. The top 73 genes were validated in 93 additional MSI CRCs. The MutSigCV ranking identified several well-established MSI CRC driver genes and provided additional evidence for previously proposed CRC candidate genes as well as shortlisted genes that have to our knowledge not been linked to CRC before. Two genes, SMARCB1 and STK38L, were also functionally scrutinized, providing evidence of a tumorigenic role, for SMARCB1 mutations in particular.
Assuntos
Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Instabilidade de Microssatélites , Mutação Puntual , Redes Reguladoras de Genes , Humanos , Anotação de Sequência Molecular , Análise de Sequência de DNARESUMO
Point mutations in cancer have been extensively studied but chromosomal gains and losses have been more challenging to interpret due to their unspecific nature. Here we examine high-resolution allelic imbalance (AI) landscape in 1699 colorectal cancers, 256 of which have been whole-genome sequenced (WGSed). The imbalances pinpoint 38 genes as plausible AI targets based on previous knowledge. Unbiased CRISPR-Cas9 knockout and activation screens identified in total 79 genes within AI peaks regulating cell growth. Genetic and functional data implicate loss of TP53 as a sufficient driver of AI. The WGS highlights an influence of copy number aberrations on the rate of detected somatic point mutations. Importantly, the data reveal several associations between AI target genes, suggesting a role for a network of lineage-determining transcription factors in colorectal tumorigenesis. Overall, the results unravel the contribution of AI in colorectal cancer and provide a plausible explanation why so few genes are commonly affected by point mutations in cancers.