RESUMO
The application of causal mediation analysis (CMA) considering the mediation effect of a third variable is increasing in epidemiological studies; however, this requires fitting strong assumptions on confounding bias. To address this limitation, we propose an extension of CMA combining it with Mendelian randomization (MRinCMA). We applied the new approach to analyse the causal effect of obesity and diabetes on pancreatic cancer, considering each factor as potential mediator. To check the performance of MRinCMA under several conditions/scenarios, we used it in different simulated data sets and compared it with structural equation models. For continuous variables, MRinCMA and structural equation models performed similarly, suggesting that both approaches are valid to obtain unbiased estimates. When noncontinuous variables were considered, MRinCMA presented, overall, lower bias than structural equation models. By applying MRinCMA, we did not find any evidence of causality of obesity or diabetes on pancreatic cancer. With this new methodology, researchers would be able to address CMA hypotheses by appropriately accounting for the confounding bias assumption regardless of the conditions used in their studies in different settings.
Assuntos
Diabetes Mellitus , Análise de Mediação , Humanos , Modelos Genéticos , Análise da Randomização Mendeliana/métodos , ObesidadeRESUMO
Pancreatic ductal adenocarcinoma (PDAC) is among the most lethal cancers. Its poor prognosis is predominantly due to the fact that most patients remain asymptomatic until the disease reaches an advanced stage, alongside the lack of early markers and screening strategies. A better understanding of PDAC risk factors is essential for the identification of groups at high risk in the population. Genome-wide association studies (GWAS) have been a powerful tool for detecting genetic variants associated with complex traits, including pancreatic cancer. By exploiting functional and GWAS data, we investigated the associations between polymorphisms affecting gene function in the pancreas (expression quantitative trait loci, eQTLs) and PDAC risk. In a two-phase approach, we analysed 13 713 PDAC cases and 43 784 controls and identified a genome-wide significant association between the A allele of the rs2035875 polymorphism and increased PDAC risk (P = 7.14 × 10-10). This allele is known to be associated with increased expression in the pancreas of the keratin genes KRT8 and KRT18, whose increased levels have been reported to correlate with various tumour cell characteristics. Additionally, the A allele of the rs789744 variant was associated with decreased risk of developing PDAC (P = 3.56 × 10-6). This single nucleotide polymorphism is situated in the SRGAP1 gene and the A allele is associated with higher expression of the gene, which in turn inactivates the cyclin-dependent protein 42 (CDC42) gene expression, thus decreasing the risk of PDAC. In conclusion, we present here a functional-based novel PDAC risk locus and an additional strong candidate supported by significant associations and plausible biological mechanisms.
Assuntos
Carcinoma Ductal Pancreático/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Neoplasias Pancreáticas/genética , Locos de Características Quantitativas , Idoso , Alelos , Estudos de Casos e Controles , Feminino , Proteínas Ativadoras de GTPase/genética , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The advent of metagenomics in animal breeding poses the challenge of statistically modelling the relationship between the microbiome, the host genetics and relevant complex traits. A set of structural equation models (SEMs) of a recursive type within a Markov chain Monte Carlo (MCMC) framework was proposed here to jointly analyse the host-metagenome-phenotype relationship. A non-recursive bivariate model was set as benchmark to compare the recursive model. The relative abundance of rumen microbes (RA), methane concentration (CH4 ) and the host genetics was used as a case of study. Data were from 337 Holstein cows from 12 herds in the north and north-west of Spain. Microbial composition from each cow was obtained from whole metagenome sequencing of ruminal content samples using a MinION device from Oxford Nanopore Technologies. Methane concentration was measured with Guardian® NG infrared gas monitor from Edinburgh Sensors during cow's visits to the milking automated system. A quarterly average from the methane eructation peaks for each cow was computed and used as phenotype for CH4 . Heritability of CH4 was estimated at 0.12 ± 0.01 in both the recursive and bivariate models. Likewise, heritability estimates for the relative abundance of the taxa overlapped between models and ranged between 0.08 and 0.48. Genetic correlations between the microbial composition and CH4 ranged from -0.76 to 0.65 in the non-recursive bivariate model and from -0.68 to 0.69 in the recursive model. Regardless of the statistical model used, positive genetic correlations with methane were estimated consistently for the seven genera pertaining to the Ciliophora phylum, as well as for those genera belonging to the Euryarchaeota (Methanobrevibacter sp.), Chytridiomycota (Neocallimastix sp.) and Fibrobacteres (Fibrobacter sp.) phyla. These results suggest that rumen's whole metagenome recursively regulates methane emissions in dairy cows and that both CH4 and the microbiota compositions are partially controlled by the host genotype.
Assuntos
Bovinos/metabolismo , Bovinos/microbiologia , Indústria de Laticínios , Metano/biossíntese , Microbiota , Modelos Estatísticos , Animais , Cadeias de Markov , Método de Monte CarloRESUMO
Primary and secondary prevention can highly benefit a personalized medicine approach through the accurate discrimination of individuals at high risk of developing a specific disease from those at moderate and low risk. To this end precise risk prediction models need to be built. This endeavor requires a precise characterization of the individual exposome, genome, and phenome. Massive molecular omics data representing the different layers of the biological processes of the host and the nonhost will enable to build more accurate risk prediction models. Epidemiologists aim to integrate omics data along with important information coming from other sources (questionnaires, candidate markers) that has been proved to be relevant in the discrimination risk assessment of complex diseases. However, the integrative models in large-scale epidemiologic research are still in their infancy and they face numerous challenges, some of them at the analytical stage. So far, there are a small number of studies that have integrated more than two omics data sets, and the inclusion of non-omics data in the same models is still missing in most of studies. In this contribution, we aim at approaching the omics and non-omics data integration from the epidemiology scope by considering the "massive" inclusion of variables in the risk assessment and predictive models. We also provide already available examples of integrative contributions in the field, propose analytical strategies that allow considering both omics and non-omics data in the models, and finally review the challenges imbedding this type of research.
Assuntos
Modelos Genéticos , Estudos Epidemiológicos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genômica , Humanos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The continuous advancement in genotyping technology has not been accompanied by the application of innovative statistical methods, such as multi-marker methods (MMM), to unravel genetic associations with complex traits. Although the performance of MMM has been widely explored in a prediction context, little is known on their behavior in the quantitative trait loci (QTL) detection under complex genetic architectures. We shed light on this still open question by applying Bayes A (BA) and Bayesian LASSO (BL) to simulated and real data. Both methods were compared to the single marker regression (SMR). Simulated data were generated in the context of six scenarios differing on effect size, minor allele frequency (MAF) and linkage disequilibrium (LD) between QTLs. These were based on real SNP genotypes in chromosome 21 from the Spanish Bladder Cancer Study. We show how the genetic architecture dramatically affects the behavior of the methods in terms of power, type I error and accuracy of estimates. Markers with high MAF are easier to detect by all methods, especially if they have a large effect on the phenotypic trait. A high LD between QTLs with either large or small effects differently affects the power of the methods: it impairs QTL detection with BA, irrespectively of the effect size, although boosts that of small effects with BL and SMR. We demonstrate the convenience of applying MMM rather than SMR because of their larger power and smaller type I error. Results from real data when applying MMM suggest novel associations not detected by SMR.
Assuntos
Simulação por Computador , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Técnicas de Genotipagem/métodos , Alelos , Teorema de Bayes , Estudos de Casos e Controles , Frequência do Gene , Genes Neoplásicos , Técnicas de Genotipagem/estatística & dados numéricos , Hispânico ou Latino/genética , Hispânico ou Latino/estatística & dados numéricos , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Neoplasias da Bexiga Urinária/epidemiologia , Neoplasias da Bexiga Urinária/genéticaRESUMO
Objective: Genome wide association studies have identified an exon 6 CTRB2 deletion variant that associates with increased risk of pancreatic cancer. To acquire evidence on its causal role, we developed a new mouse strain carrying an equivalent variant in Ctrb1 , the mouse orthologue of CTRB2 . Design: We used CRISPR/Cas9 to introduce a 707bp deletion in Ctrb1 encompassing exon 6 ( Ctrb1 Δexon6 ). This mutation closely mimics the human deletion variant. Mice carrying the mutant allele were extensively profiled at 3 months to assess their phenotype. Results: Ctrb1 Δexon6 mutant mice express a truncated CTRB1 that accumulates in the ER. The pancreas of homozygous mutant mice displays reduced chymotrypsin activity and total protein synthesis. The histological aspect of the pancreas is inconspicuous but ultrastructural analysis shows evidence of dramatic ER stress and cytoplasmic and nuclear inclusions. Transcriptomic analyses of the pancreas of mutant mice reveals acinar program down-regulation and increased activity of ER stress-related and inflammatory pathways. Heterozygous mice have an intermediate phenotype. Agr2 is one of the most up-regulated genes in mutant pancreata. Ctrb1 Δexon6 mice exhibit impaired recovery from acute caerulein-induced pancreatitis. Administration of TUDCA or sulindac partially alleviates the phenotype. A transcriptomic signature derived from the mutant pancreata is significantly enriched in normal human pancreas of CTRB2 exon 6 deletion variant carriers from the GTEx cohort. Conclusions: This mouse strain provides formal evidence that the Ctrb1 Δexon6 variant causes ER stress and inflammation in vivo , providing an excellent model to understand its contribution to pancreatic ductal adenocarcinoma development and to identify preventive strategies. SUMMARY BOX: What is already known about this subject?: - CTRB2 is one of the most abundant proteins produced by human pancreatic acinar cells. - A common exon 6 deletion variant in CTRB2 has been associated with an increased risk of pancreatic ductal adenocarcinoma. - Misfolding of digestive enzymes is associated with pancreatic pathology.What are the new findings?: - We developed a novel genetic model that recapitulates the human CTRB2 deletion variant in the mouse orthologue, Ctrb1 . - Truncated CTRB1 misfolds and accumulates in the ER; yet, mutant mice display a histologically normal pancreas at 3 months age.- CTRB1 and associated chaperones colocalize in the ER, the cytoplasm, and the nucleus of acinar cells.- Transcriptomics analysis reveals reduced activity of the acinar program and increased activity of pathways involved in ER stress, unfolded protein response, and inflammation.- Mutant mice are sensitized to pancreatic damage and do not recover properly from a mild caerulein-induced pancreatitis.- TUDCA administration partially relieves the ER stress in mutant mice.How might it impact on clinical practice in the foreseeable future?: - The new mouse model provides a tool to identify the mechanisms leading to increased pancreatic cancer risk in CTRB2 exon 6 carriers. - The findings suggest that drugs that cause ER stress relief and/or reduce inflammation might provide preventive opportunities.
RESUMO
BACKGROUND: Efficient presentation of mutant peptide fragments by the human leukocyte antigen class I (HLA-I) genes is necessary for immune-mediated killing of cancer cells. According to recent reports, patient HLA-I genotypes can impact the efficacy of cancer immunotherapy, and the somatic loss of HLA-I heterozygosity has been established as a factor in immune evasion. While global deregulated expression of HLA-I has also been reported in different tumor types, the role of HLA-I allele-specific expression loss - that is, the preferential RNA expression loss of specific HLA-I alleles - has not been fully characterized in cancer. METHODS: Here, we use RNA and whole-exome sequencing data to quantify HLA-I allele-specific expression (ASE) in cancer using our novel method arcasHLA-quant. RESULTS: We show that HLA-I ASE loss in at least one of the three HLA-I genes is a pervasive phenomenon across TCGA tumor types. In pancreatic adenocarcinoma, tumor-specific HLA-I ASE loss is associated with decreased overall survival specifically in the basal-like subtype, a finding that we validated in an independent cohort through laser-capture microdissection. Additionally, we show that HLA-I ASE loss is associated with poor immunotherapy outcomes in metastatic melanoma through retrospective analyses. CONCLUSIONS: Together, our results highlight the prevalence of HLA-I ASE loss and provide initial evidence of its clinical significance in cancer prognosis and immunotherapy treatment.
Assuntos
Adenocarcinoma , Neoplasias Pancreáticas , Humanos , Alelos , Adenocarcinoma/genética , Estudos Retrospectivos , Neoplasias Pancreáticas/genética , Antígenos de Histocompatibilidade Classe I/genética , RNARESUMO
Interest on methane emissions from livestock has increased in later years as it is an anthropogenic greenhouse gas with an important warming potential. The rumen microbiota has a large influence on the production of enteric methane. Animals harbour a second genome consisting of microbes, collectively referred to as the "microbiome". The rumen microbial community plays an important role in feed digestion, feed efficiency, methane emission and health status. This review recaps the current knowledge on the genetic control that the cow exerts on the rumen microbiota composition. Heritability estimates for the rumen microbiota composition range between 0.05 and 0.40 in the literature, depending on the taxonomical group or microbial gene function. Variables depicting microbial diversity or aggregating microbial information are also heritable within the same range. This study includes a genome-wide association analysis on the microbiota composition, considering the relative abundance of some microbial taxa previously associated to enteric methane in dairy cattle (Archaea, Dialister, Entodinium, Eukaryota, Lentisphaerae, Methanobrevibacter, Neocallimastix, Prevotella and Stentor). Host genomic regions associated with the relative abundance of these microbial taxa were identified after Benjamini-Hoschberg correction (Padj < 0.05). An in-silico functional analysis using FUMA and DAVID online tools revealed that these gene sets were enriched in tissues like brain cortex, brain amigdala, pituitary, salivary glands and other parts of the digestive system, and are related to appetite, satiety and digestion. These results allow us to have greater knowledge about the composition and function of the rumen microbiome in cattle. The state-of-the art strategies to include methane traits in the selection indices in dairy cattle populations is reviewed. Several strategies to include methane traits in the selection indices have been studied worldwide, using bioeconomical models or economic functions under theoretical frameworks. However, their incorporation in the breeding programmes is still scarce. Some potential strategies to include methane traits in the selection indices of dairy cattle population are presented. Future selection indices will need to increase the weight of traits related to methane emissions and sustainability. This review will serve as a compendium of the current state of the art in genetic strategies to reduce methane emissions in dairy cattle.
Assuntos
Metano , Microbiota , Feminino , Bovinos , Animais , Metano/metabolismo , Estudo de Associação Genômica Ampla/veterinária , Bactérias/genética , Archaea/genética , Rúmen/metabolismoRESUMO
BACKGROUND: Genomic regions identified by genome-wide association studies (GWAS) for bladder cancer risk provide new insights into etiology. OBJECTIVE: To identify new susceptibility variants for bladder cancer in a meta-analysis of new and existing genome-wide genotype data. DESIGN, SETTING, AND PARTICIPANTS: Data from 32 studies that includes 13,790 bladder cancer cases and 343,502 controls of European ancestry were used for meta-analysis. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSES: Log-additive associations of genetic variants were assessed using logistic regression models. A fixed-effects model was used for meta-analysis of the results. Stratified analyses were conducted to evaluate effect modification by sex and smoking status. A polygenic risk score (PRS) was generated on the basis of known and novel susceptibility variants and tested for interaction with smoking. RESULTS AND LIMITATIONS: Multiple novel bladder cancer susceptibility loci (6p.22.3, 7q36.3, 8q21.13, 9p21.3, 10q22.1, 19q13.33) as well as improved signals in three known regions (4p16.3, 5p15.33, 11p15.5) were identified, bringing the number of independent markers at genome-wide significance (p < 5 × 10-8) to 24. The 4p16.3 (FGFR3/TACC3) locus was associated with a stronger risk for women than for men (p-interaction = 0.002). Bladder cancer risk was increased by interactions between smoking status and genetic variants at 8p22 (NAT2; multiplicative p value for interaction [pM-I] = 0.004), 8q21.13 (PAG1; pM-I = 0.01), and 9p21.3 (LOC107987026/MTAP/CDKN2A; pM-I = 0.02). The PRS based on the 24 independent GWAS markers (odds ratio per standard deviation increase 1.49, 95% confidence interval 1.44-1.53), which also showed comparable results in two prospective cohorts (UK Biobank, PLCO trial), revealed an approximately fourfold difference in the lifetime risk of bladder cancer according to the PRS (e.g., 1st vs 10th decile) for both smokers and nonsmokers. CONCLUSIONS: We report novel loci associated with risk of bladder cancer that provide clues to its biological underpinnings. Using 24 independent markers, we constructed a PRS to stratify lifetime risk. The PRS combined with smoking history, and other established risk factors, has the potential to inform future screening efforts for bladder cancer. PATIENT SUMMARY: We identified new genetic markers that provide biological insights into the genetic causes of bladder cancer. These genetic risk factors combined with lifestyle risk factors, such as smoking, may inform future preventive and screening strategies for bladder cancer.
Assuntos
Arilamina N-Acetiltransferase , Neoplasias da Bexiga Urinária , Masculino , Humanos , Feminino , Estudo de Associação Genômica Ampla , Estudos Prospectivos , Fatores de Risco , Genótipo , Neoplasias da Bexiga Urinária/genética , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Proteínas Associadas aos Microtúbulos , Proteínas de Membrana , Proteínas Adaptadoras de Transdução de SinalRESUMO
Background: Infiltrating B and T cells have been observed in several tumor tissues, including pancreatic ductal adenocarcinoma (PDAC). The majority known PDAC risk factors point to a chronic inflammatory process leading to different forms of immunological infiltration. Understanding pancreatic tumor infiltration may lead to improved knowledge of this devastating disease. Methods: We extracted the immunoglobulins (IGs) and T cell receptors (TCRs) from RNA-sequencing of 144 PDAC from TCGA and 180 pancreatic normal tissue from GTEx. We used Shannon entropy to find differences in IG/TCR diversity. We performed a clonotype analysis considering the IG clone definition (same V and J segments, same CDR3 length, and 90% nucleotide identity between CDR3s) to study differences among the tumor samples. Finally, we performed an association analysis to find host and tumor factors associated with the IG/TCR. Results: PDAC presented a richer and more diverse IG and TCR infiltration than normal pancreatic tissue. A higher IG infiltration was present in heavy smokers and females and it was associated with better overall survival. In addition, specific IG clonotypes classified samples with better prognosis explaining 24% of the prognosis phenotypic variance. On the other hand, a larger TCR infiltration was present in patients with previous history of diabetes and was associated with lower nonantigen load. Conclusions: Our findings support PDAC subtyping according to its immune repertoire landscape with a potential impact on the understanding of the inflammatory basis of PDAC risk factors as well as the design of treatment options and prognosis monitoring.
Assuntos
Linfócitos B/imunologia , Carcinoma Ductal Pancreático/imunologia , Linfócitos do Interstício Tumoral/imunologia , Neoplasias Pancreáticas/imunologia , Linfócitos T/imunologia , Microambiente Tumoral/imunologia , Adulto , Idoso , Linfócitos B/metabolismo , Carcinoma Ductal Pancreático/genética , Carcinoma Ductal Pancreático/mortalidade , Carcinoma Ductal Pancreático/terapia , Bases de Dados Genéticas , Feminino , Perfilação da Expressão Gênica , Humanos , Imunoglobulinas/genética , Linfócitos do Interstício Tumoral/metabolismo , Masculino , Pessoa de Meia-Idade , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/mortalidade , Neoplasias Pancreáticas/terapia , Fenótipo , Prognóstico , RNA-Seq , Receptores de Antígenos de Linfócitos T/genética , Fatores de Risco , Fatores Sexuais , Fumar/efeitos adversos , Linfócitos T/metabolismo , TranscriptomaRESUMO
Genetic factors play an important role in the susceptibility to pancreatic cancer (PC). However, established loci explain a small proportion of genetic heritability for PC; therefore, more progress is needed to find the missing ones. We aimed at identifying single nucleotide polymorphisms (SNPs) affecting PC risk through effects on micro-RNA (miRNA) function. We searched in silico the genome for SNPs in miRNA seed sequences or 3 prime untranslated regions (3'UTRs) of miRNA target genes. Genome-wide association data of PC cases and controls from the Pancreatic Cancer Cohort (PanScan) Consortium and the Pancreatic Cancer Case-Control (PanC4) Consortium were re-analyzed for discovery, and genotyping data from two additional consortia (PanGenEU and PANDoRA) were used for replication, for a total of 14,062 cases and 11,261 controls. None of the SNPs reached genome-wide significance in the meta-analysis, but for three of them the associations were in the same direction in all the study populations and showed lower value of p in the meta-analyses than in the discovery phase. Specifically, rs7985480 was consistently associated with PC risk (OR = 1.12, 95% CI 1.07-1.17, p = 3.03 × 10-6 in the meta-analysis). This SNP is in linkage disequilibrium (LD) with rs2274048, which modulates binding of various miRNAs to the 3'UTR of UCHL3, a gene involved in PC progression. In conclusion, our results expand the knowledge of the genetic PC risk through miRNA-related SNPs and show the usefulness of functional prioritization to identify genetic polymorphisms associated with PC risk.
RESUMO
BACKGROUND: Pancreatic cancer (PC) is a complex disease in which both non-genetic and genetic factors interplay. To date, 40 GWAS hits have been associated with PC risk in individuals of European descent, explaining 4.1% of the phenotypic variance. METHODS: We complemented a new conventional PC GWAS (1D) with genome spatial autocorrelation analysis (2D) permitting to prioritize low frequency variants not detected by GWAS. These were further expanded via Hi-C map (3D) interactions to gain additional insight into the inherited basis of PC. In silico functional analysis of public genomic information allowed prioritization of potentially relevant candidate variants. RESULTS: We identified several new variants located in genes for which there is experimental evidence of their implication in the biology and function of pancreatic acinar cells. Among them is a novel independent variant in NR5A2 (rs3790840) with a meta-analysis p value = 5.91E-06 in 1D approach and a Local Moran's Index (LMI) = 7.76 in 2D approach. We also identified a multi-hit region in CASC8-a lncRNA associated with pancreatic carcinogenesis-with a lowest p value = 6.91E-05. Importantly, two new PC loci were identified both by 2D and 3D approaches: SIAH3 (LMI = 18.24), CTRB2/BCAR1 (LMI = 6.03), in addition to a chromatin interacting region in XBP1-a major regulator of the ER stress and unfolded protein responses in acinar cells-identified by 3D; all of them with a strong in silico functional support. CONCLUSIONS: This multi-step strategy, combined with an in-depth in silico functional analysis, offers a comprehensive approach to advance the study of PC genetic susceptibility and could be applied to other diseases.
Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Neoplasias Pancreáticas/genética , Biomarcadores Tumorais/genética , Linhagem Celular Tumoral , Simulação por Computador , Redes Reguladoras de Genes , Genoma Humano , Humanos , Desequilíbrio de Ligação/genética , Reprodutibilidade dos Testes , Transdução de Sinais/genéticaRESUMO
CONTEXT: The identification of markers able to determine medullary thyroid cancer (MTC) patients at high-risk of disease progression is critical to improve their clinical management and outcome. Previous studies have suggested that expression of the stem cell marker CD133 is associated with MTC aggressiveness. OBJECTIVE: To evaluate CD133 impact on disease progression in MTC and explore the regulatory mechanisms leading to the upregulation of this protein in aggressive tumors. PATIENTS: We compiled a series of 74 MTCs with associated clinical data and characterized them for mutations in RET and RAS proto-oncogenes, presumed to be related with disease clinical behavior. RESULTS: We found that CD133 immunohistochemical expression was associated with adverse clinicopathological features and predicted a reduction in time to disease progression even when only RET-mutated cases were considered in the analysis (log-rank test Pâ <â 0.003). Univariate analysis for progression-free survival revealed CD133 expression and presence of tumor emboli in peritumoral blood vessels as the most significant prognostic covariates among others such as age, gender, and prognostic stage. Multivariate analysis identified both variables as independent factors of poor prognosis (hazard ratioâ =â 16.6 and 2; Pâ =â 0.001 and 0.010, respectively). Finally, we defined hsa-miR-30a-5p, a miRNA downregulated in aggressive MTCs, as a CD133 expression regulator. Ectopic expression of hsa-miR-30a-5p in MZ-CRC-1 (RETM918T) cells significantly reduced CD133 mRNA expression. CONCLUSIONS: Our results suggest that CD133 expression may be a useful tool to identify MTC patients with poor prognosis, who may benefit from a more extensive primary surgical management and follow-up.
Assuntos
Antígeno AC133/metabolismo , Carcinoma Medular/metabolismo , Glândula Tireoide/metabolismo , Neoplasias da Glândula Tireoide/metabolismo , Antígeno AC133/genética , Adulto , Idoso , Biomarcadores Tumorais/metabolismo , Carcinoma Medular/genética , Carcinoma Medular/patologia , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Mutação , Prognóstico , Intervalo Livre de Progressão , Proteínas Proto-Oncogênicas c-ret/genética , Glândula Tireoide/patologia , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia , Proteínas ras/genéticaRESUMO
Omics data integration is already a reality. However, few omics-based algorithms show enough predictive ability to be implemented into clinics or public health domains. Clinical/epidemiological data tend to explain most of the variation of health-related traits, and its joint modeling with omics data is crucial to increase the algorithm's predictive ability. Only a small number of published studies performed a "real" integration of omics and non-omics (OnO) data, mainly to predict cancer outcomes. Challenges in OnO data integration regard the nature and heterogeneity of non-omics data, the possibility of integrating large-scale non-omics data with high-throughput omics data, the relationship between OnO data (i.e., ascertainment bias), the presence of interactions, the fairness of the models, and the presence of subphenotypes. These challenges demand the development and application of new analysis strategies to integrate OnO data. In this contribution we discuss different attempts of OnO data integration in clinical and epidemiological studies. Most of the reviewed papers considered only one type of omics data set, mainly RNA expression data. All selected papers incorporated non-omics data in a low-dimensionality fashion. The integrative strategies used in the identified papers adopted three modeling methods: Independent, conditional, and joint modeling. This review presents, discusses, and proposes integrative analytical strategies towards OnO data integration.
Assuntos
Biologia Computacional/métodos , Locos de Características Quantitativas , Algoritmos , Predisposição Genética para Doença , Genômica , Humanos , Modelos Genéticos , Prognóstico , Análise de Sequência de RNARESUMO
The emergence of high-throughput data in biology has increased the need for functional in silico analysis and prompted the development of integrative bioinformatics tools to facilitate the obtainment of biologically meaningful data. In this paper, we present DoriTool, a comprehensive, easy, and friendly pipeline integrating biological data from different functional tools. The tool was designed with the aim to maximize reproducibility and reduce the working time of the researchers, especially of those with limited bioinformatics skills, and to help them with the interpretation of the results. DoriTool is based upon an integrative strategy implemented following a modular design pattern. Using scripts written in Bash, Perl and R, it performs a functional in silico analysis annotation at mutation/variant level, gene level, pathway level and network level by combining up-to-date functional and genomic data and integrating also third-party bioinformatics tools in a pipeline. DoriTool uses GRCh37 human assembly and online mode. DoriTool provides nice visual reports including variant annotation, linkage disequilibrium proxies, gene annotation, gene ontology analysis, expression quantitative trait loci results from Genotype-Tissue Expression (GTEx) and coloured pathways. Here, we also show DoriTool functionalities by applying it to a dataset of 13 variants associated with prostate cancer. Project development, released code libraries, GitHub repository (https://github.com/doritool) and documentation are hosted at https://doritool.github.io/. DoriTool is, to our knowledge, the most complete bioinformatics tool offering functional in silico annotation of variants previously associated with a trait of interest, shedding light on the underlying biology and helping the researchers in the interpretation and discussion of the results.
Assuntos
Biologia Computacional/métodos , Genômica/métodos , Anotação de Sequência Molecular/métodos , Software , Simulação por Computador , Estudos de Associação Genética/métodos , Variação Genética/genética , Humanos , Fenótipo , Reprodutibilidade dos TestesRESUMO
BACKGROUND: Increasing evidence points to the role of tumor immunologic environment on urothelial bladder cancer prognosis. This effect might be partly dependent on the host genetic context. We evaluated the association of SNPs in inflammation-related genes with non-muscle-invasive bladder cancer (NMIBC) risk-of-recurrence and risk-of-progression. METHODS: We considered 822 NMIBC included in the SBC/EPICURO Study followed-up >10 years. We selected 1,679 SNPs belonging to 251 inflammatory genes. The association of SNPs with risk-of-recurrence and risk-of-progression was assessed using Cox regression single-marker (SMM) and multimarker methods (MMM) Bayes A and Bayesian LASSO. Discriminative abilities of the models were calculated using the c index and validated with bootstrap cross-validation procedures. RESULTS: While no SNP was found to be associated with risk-of-recurrence using SMM, three SNPs in TNIP1, CD5, and JAK3 showed very strong association with posterior probabilities >90% using MMM. Regarding risk-of-progression, one SNP in CD3G was significantly associated using SMM (HR, 2.69; P = 1.55 × 10(-5)) and two SNPs in MASP1 and AIRE, showed a posterior probability ≥80% with MMM. Validated discriminative abilities of the models without and with the SNPs were 58.4% versus 60.5% and 72.1% versus 72.8% for risk-of-recurrence and risk-of-progression, respectively. CONCLUSIONS: Using innovative analytic approaches, we demonstrated that SNPs in inflammatory-related genes were associated with NMIBC prognosis and that they improve the discriminative ability of prognostic clinical models for NMIBC. IMPACT: This study provides proof of concept for the joint effect of genetic variants in improving the discriminative ability of clinical prognostic models. The approach may be extended to other diseases. Cancer Epidemiol Biomarkers Prev; 25(7); 1144-50. ©2016 AACR.
Assuntos
Biomarcadores Tumorais/genética , Carcinoma de Células de Transição/genética , Neoplasias da Bexiga Urinária/genética , Idoso , Teorema de Bayes , Carcinoma de Células de Transição/patologia , Progressão da Doença , Feminino , Seguimentos , Predisposição Genética para Doença , Variação Genética , Humanos , Estimativa de Kaplan-Meier , Masculino , Pessoa de Meia-Idade , Recidiva Local de Neoplasia/genética , Polimorfismo de Nucleotídeo Único , Modelos de Riscos Proporcionais , Fatores de Risco , Neoplasias da Bexiga Urinária/patologiaRESUMO
INTRODUCTION: Germline variants in TP63 have been consistently associated with several tumors, including bladder cancer, indicating the importance of TP53 pathway in cancer genetic susceptibility. However, variants in other related genes, including TP53 rs1042522 (Arg72Pro), still present controversial results. We carried out an in depth assessment of associations between common germline variants in the TP53 pathway and bladder cancer risk. MATERIAL AND METHODS: We investigated 184 tagSNPs from 18 genes in 1,058 cases and 1,138 controls from the Spanish Bladder Cancer/EPICURO Study. Cases were newly-diagnosed bladder cancer patients during 1998-2001. Hospital controls were age-gender, and area matched to cases. SNPs were genotyped in blood DNA using Illumina Golden Gate and TaqMan assays. Cases were subphenotyped according to stage/grade and tumor p53 expression. We applied classical tests to assess individual SNP associations and the Least Absolute Shrinkage and Selection Operator (LASSO)-penalized logistic regression analysis to assess multiple SNPs simultaneously. RESULTS: Based on classical analyses, SNPs in BAK1 (1), IGF1R (5), P53AIP1 (1), PMAIP1 (2), SERINPB5 (3), TP63 (3), and TP73 (1) showed significant associations at p-value≤0.05. However, no evidence of association, either with overall risk or with specific disease subtypes, was observed after correction for multiple testing (p-value≥0.8). LASSO selected the SNP rs6567355 in SERPINB5 with 83% of reproducibility. This SNP provided an ORâ=â1.21, 95%CI 1.05-1.38, p-valueâ=â0.006, and a corrected p-valueâ=â0.5 when controlling for over-estimation. DISCUSSION: We found no strong evidence that common variants in the TP53 pathway are associated with bladder cancer susceptibility. Our study suggests that it is unlikely that TP53 Arg72Pro is implicated in the UCB in white Europeans. SERPINB5 and TP63 variation deserve further exploration in extended studies.