RESUMO
Mendelian randomization is a powerful method for inferring causal relationships. However, obtaining suitable genetic instrumental variables is often challenging due to gene interaction, linkage, and pleiotropy. We propose Bayesian network-based Mendelian randomization (BNMR), a Bayesian causal learning and inference framework using individual-level data. BNMR employs the random graph forest, an ensemble Bayesian network structural learning process, to prioritize candidate genetic variants and select appropriate instrumental variables, and then obtains a pleiotropy-robust estimate by incorporating a shrinkage prior in the Bayesian framework. Simulations demonstrate BNMR can efficiently reduce the false-positive discoveries in variant selection, and outperforms existing MR methods in terms of accuracy and statistical power in effect estimation. With application to the UK Biobank, BNMR exhibits its capacity in handling modern genomic data, and reveals the causal relationships from hematological traits to blood pressures and psychiatric disorders. Its effectiveness in handling complex genetic structures and modern genomic data highlights the potential to facilitate real-world evidence studies, making it a promising tool for advancing our understanding of causal mechanisms.
Assuntos
Teorema de Bayes , Análise da Randomização Mendeliana , Fenótipo , Humanos , Análise da Randomização Mendeliana/métodos , Variação Genética , Estudo de Associação Genômica Ampla/métodos , Modelos GenéticosRESUMO
With the development of genome-wide association studies, how to gain information from a large scale of data has become an issue of common concern, since traditional methods are not fully developed to solve problems such as identifying loci-to-loci interactions (also known as epistasis). Previous epistatic studies mainly focused on local information with a single outcome (phenotype), while in this paper, we developed a two-stage global search algorithm, Greedy Equivalence Search with Local Modification (GESLM), to implement a global search of directed acyclic graph in order to identify genome-wide epistatic interactions with multiple outcome variables (phenotypes) in a case-control design. GESLM integrates the advantages of score-based methods and constraint-based methods to learn the phenotype-related Bayesian network and is powerful and robust to find the interaction structures that display both genetic associations with phenotypes and gene interactions. We compared GESLM with some common phenotype-related loci detecting methods in simulation studies. The results showed that our method improved the accuracy and efficiency compared with others, especially in an unbalanced case-control study. Besides, its application on the UK Biobank dataset suggested that our algorithm has great performance when handling genome-wide association data with more than one phenotype.
Assuntos
Algoritmos , Estudo de Associação Genômica Ampla , Fenótipo , Polimorfismo de Nucleotídeo Único , Teorema de Bayes , Conjuntos de Dados como Assunto , HumanosRESUMO
Bayesian methods are widely used in the GWAS meta-analysis. But the considerable consumption in both computing time and memory space poses great challenges for large-scale meta-analyses. In this research, we propose an algorithm named SMetABF to rapidly obtain the optimal ABF in the GWAS meta-analysis, where shotgun stochastic search (SSS) is introduced to improve the Bayesian GWAS meta-analysis framework, MetABF. Simulation studies confirm that SMetABF performs well in both speed and accuracy, compared to exhaustive methods and MCMC. SMetABF is applied to real GWAS datasets to find several essential loci related to Parkinson's disease (PD) and the results support the underlying relationship between PD and other autoimmune disorders. Developed as an R package and a web tool, SMetABF will become a useful tool to integrate different studies and identify more variants associated with complex traits.
Assuntos
Algoritmos , Estudo de Associação Genômica Ampla , Teorema de Bayes , Simulação por Computador , Estudo de Associação Genômica Ampla/métodos , Metanálise como Assunto , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The pathogenesis of Alzheimer disease (AD) involves complex gene regulatory changes across different cell types. To help decipher this complexity, we introduce single-cell Bayesian biclustering (scBC), a framework for identifying cell-specific gene network biomarkers in scRNA and snRNA-seq data. Through biclustering, scBC enables the analysis of perturbations in functional gene modules at the single-cell level. Applying the scBC framework to AD snRNA-seq data reveals the perturbations within gene modules across distinct cell groups and sheds light on gene-cell correlations during AD progression. Notably, our method helps to overcome common challenges in single-cell data analysis, including batch effects and dropout events. Incorporating prior knowledge further enables the framework to yield more biologically interpretable results. Comparative analyses on simulated and real-world datasets demonstrate the precision and robustness of our approach compared to other state-of-the-art biclustering methods. scBC holds potential for unraveling the mechanisms underlying polygenic diseases characterized by intricate gene coexpression patterns.
Assuntos
Doença de Alzheimer , Progressão da Doença , Análise de Célula Única , Transcriptoma , Humanos , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Doença de Alzheimer/patologia , Análise de Célula Única/métodos , Transcriptoma/genética , Análise por Conglomerados , Teorema de Bayes , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes/genéticaRESUMO
Investigation of the genetic basis of traits or clinical outcomes heavily relies on identifying relevant variables in molecular data. However, characteristics such as high dimensionality and complex correlation structures of these data hinder the development of related methods, resulting in the inclusion of false positives and negatives. We developed a variable importance measure method, termed the ECAR scores, that evaluates the importance of variables in the dataset. Based on this score, ranking and selection of variables can be achieved simultaneously. Unlike most current approaches, the ECAR scores aim to rank the influential variables as high as possible while maintaining the grouping property, instead of selecting the ones that are merely predictive. The ECAR scores' performance is tested and compared to other methods on simulated, semi-synthetic, and real datasets. Results showed that the ECAR scores improve the CAR scores in terms of accuracy of variable selection and high-rank variables' predictive power. It also outperforms other classic methods such as lasso and stability selection when there is a high degree of correlation among influential variables. As an application, we used the ECAR scores to analyze genes associated with forced expiratory volume in the first second in patients with lung cancer and reported six associated genes.
Assuntos
Biomarcadores Tumorais/metabolismo , Simulação por Computador , Volume Expiratório Forçado , Regulação Neoplásica da Expressão Gênica , Hordeum/metabolismo , Neoplasias Pulmonares/patologia , Proteínas de Plantas/metabolismo , Biomarcadores Tumorais/genética , Perfilação da Expressão Gênica , Hordeum/genética , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/metabolismo , Proteínas de Plantas/genéticaRESUMO
Heme oxygenase-1 (HO-1) has anti-inflammatory effects in asthma. CD4+CD25(high) regulatory T cells (Treg) are a potent immunoregulator that suppresses the immune response. We studied the effects of HO-1-mediated CD4+CD25(high) Treg on suppression of allergic airway inflammation by comparing mice treated with hemin, OVA, Sn-protoporphyrin (SnPP), and hemin plus SnPP. Airway responsiveness, airway eosinophil infiltration, the level of OVA-specific IgE, and the numbers of cells in general and eosinophils in particular in bronchial alveolar lavage fluid were lower in the hemin group than in the OVA, SnPP, and hemin plus SnPP groups. The expressions of HO-1 mRNA and protein in the lung were increased by repeated administrations of hemin and SnPP. However, the activity of HO-1 was highest in hemin mice. The percentage and suppressive function of CD4+CD25(high) Treg and the expression of Foxp3 mRNA were obviously enhanced after treatment with hemin. This increase was diminished by the administration of SnPP. The concentration of serum IL-10 was higher in the hemin group than in the other groups, whereas the level of serum TGF-beta did not significantly differ across groups. Furthermore, the ratio of IFN-gamma/IL-4 mRNA in the lung was higher in hemin-treated mice than in OVA and SnPP mice. The suppressive capacity of CD4+CD25(high) Treg was not enhanced in the IL-10-deficient mice treated with hemin. In conclusion, our experiments in the animal model demonstrated that HO-1 has anti-inflammatory effects, probably via enhancement of the secretion of IL-10 and promotion of the percentage of CD4+CD25(high) Treg.