Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 87
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Bioinformatics ; 39(1)2023 01 01.
Article in English | MEDLINE | ID: mdl-36579860

ABSTRACT

MOTIVATION: During disease progression or organism development, alternative splicing may lead to isoform switches that demonstrate similar temporal patterns and reflect the alternative splicing co-regulation of such genes. Tools for dynamic process analysis usually neglect alternative splicing. RESULTS: Here, we propose Spycone, a splicing-aware framework for time course data analysis. Spycone exploits a novel IS detection algorithm and offers downstream analysis such as network and gene set enrichment. We demonstrate the performance of Spycone using simulated and real-world data of SARS-CoV-2 infection. AVAILABILITY AND IMPLEMENTATION: The Spycone package is available as a PyPI package. The source code of Spycone is available under the GPLv3 license at https://github.com/yollct/spycone and the documentation at https://spycone.readthedocs.io/en/latest/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Alternative Splicing , COVID-19 , Humans , SARS-CoV-2/genetics , Software , Algorithms
2.
Cytometry A ; 105(3): 181-195, 2024 03.
Article in English | MEDLINE | ID: mdl-37984809

ABSTRACT

Multiparameter flow cytometry (MFC) has emerged as a standard method for quantifying measurable residual disease (MRD) in acute myeloid leukemia. However, the limited number of available channels on conventional flow cytometers requires the division of a diagnostic sample into several tubes, restricting the number of cells and the complexity of immunophenotypes that can be analyzed. Full spectrum flow cytometers overcome this limitation by enabling the simultaneous use of up to 40 fluorescent markers. Here, we used this approach to develop a good laboratory practice-conform single-tube 19-color MRD detection assay that complies with recommendations of the European LeukemiaNet Flow-MRD Working Party. We based our assay on clinically-validated antibody clones and evaluated its performance on an IVD-certified full spectrum flow cytometer. We measured MRD and normal bone marrow samples and compared the MRD data to a widely used reference MRD-MFC panel generating highly concordant results. Using our newly developed single-tube panel, we established reference values in healthy bone marrow for 28 consensus leukemia-associated immunophenotypes and introduced a semi-automated dimensionality-reduction, clustering and cell type identification approach that aids the unbiased detection of aberrant cells. In summary, we provide a comprehensive full spectrum MRD-MFC workflow with the potential for rapid implementation for routine diagnostics due to reduced cell requirements and ease of data analysis with increased reproducibility in comparison to conventional FlowMRD routines.


Subject(s)
Leukemia, Myeloid, Acute , Humans , Flow Cytometry/methods , Reproducibility of Results , Leukemia, Myeloid, Acute/diagnosis , Bone Marrow/metabolism , Neoplasm, Residual/diagnosis
3.
Bioinformatics ; 38(6): 1600-1606, 2022 03 04.
Article in English | MEDLINE | ID: mdl-34984440

ABSTRACT

MOTIVATION: Disease module mining methods (DMMMs) extract subgraphs that constitute candidate disease mechanisms from molecular interaction networks such as protein-protein interaction (PPI) networks. Irrespective of the employed models, DMMMs typically include non-robust steps in their workflows, i.e. the computed subnetworks vary when running the DMMMs multiple times on equivalent input. This lack of robustness has a negative effect on the trustworthiness of the obtained subnetworks and is hence detrimental for the widespread adoption of DMMMs in the biomedical sciences. RESULTS: To overcome this problem, we present a new DMMM called ROBUST (robust disease module mining via enumeration of diverse prize-collecting Steiner trees). In a large-scale empirical evaluation, we show that ROBUST outperforms competing methods in terms of robustness, scalability and, in most settings, functional relevance of the produced modules, measured via KEGG (Kyoto Encyclopedia of Genes and Genomes) gene set enrichment scores and overlap with DisGeNET disease genes. AVAILABILITY AND IMPLEMENTATION: A Python 3 implementation and scripts to reproduce the results reported in this article are available on GitHub: https://github.com/bionetslab/robust, https://github.com/bionetslab/robust-eval. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , Trees , Computational Biology/methods , Protein Interaction Maps
4.
PLoS Biol ; 18(11): e3000885, 2020 11.
Article in English | MEDLINE | ID: mdl-33170835

ABSTRACT

Hypertension is the most important cause of death and disability in the elderly. In 9 out of 10 cases, the molecular cause, however, is unknown. One mechanistic hypothesis involves impaired endothelium-dependent vasodilation through reactive oxygen species (ROS) formation. Indeed, ROS forming NADPH oxidase (Nox) genes associate with hypertension, yet target validation has been negative. We re-investigate this association by molecular network analysis and identify NOX5, not present in rodents, as a sole neighbor to human vasodilatory endothelial nitric oxide (NO) signaling. In hypertensive patients, endothelial microparticles indeed contained higher levels of NOX5-but not NOX1, NOX2, or NOX4-with a bimodal distribution correlating with disease severity. Mechanistically, mice expressing human Nox5 in endothelial cells developed-upon aging-severe systolic hypertension and impaired endothelium-dependent vasodilation due to uncoupled NO synthase (NOS). We conclude that NOX5-induced uncoupling of endothelial NOS is a causal mechanism and theragnostic target of an age-related hypertension endotype. Nox5 knock-in (KI) mice represent the first mechanism-based animal model of hypertension.


Subject(s)
Hypertension/physiopathology , NADPH Oxidase 5/genetics , Nitric Oxide/metabolism , Adult , Age Factors , Aged , Animals , Endothelial Cells , Endothelium, Vascular , Female , Gene Knock-In Techniques/methods , Humans , Hypertension/genetics , Hypertension/metabolism , Male , Membrane Proteins/genetics , Mice , Middle Aged , NADPH Oxidase 5/metabolism , NADPH Oxidases/genetics , NADPH Oxidases/metabolism , Nitric Oxide/genetics , Nitric Oxide Synthase/genetics , Nitric Oxide Synthase/metabolism , Nitric Oxide Synthase Type III/genetics , Nitric Oxide Synthase Type III/metabolism , Reactive Oxygen Species
5.
Clin Chem Lab Med ; 61(2): 260-265, 2023 01 27.
Article in English | MEDLINE | ID: mdl-36321255

ABSTRACT

OBJECTIVES: Laboratory information systems typically contain hundreds or even thousands of reference limits stratified by sex and age. Since under these conditions a manual plausibility check is hardly feasible, we have developed a simple algorithm that facilitates this check. An open-source R tool is available as a Shiny application at github.com/SandraKla/Zlog_AdRI. METHODS: Based on the zlog standardization, we can possibly detect critical jumps at the transitions between age groups, regardless of the analytical method or the measuring unit. Its advantage compared to the standard z-value is that means and standard deviations are calculated from the reference limits rather than from the underlying data itself. The purpose of the tool is illustrated by the example of reference intervals of children and adolescents from the Canadian Laboratory Initiative on Pediatric Reference Intervals (CALIPER). RESULTS: The Shiny application identifies the zlog values, lists them in a colored table format and plots them additionally with the specified reference intervals. The algorithm detected several strong and rapid changes in reference intervals from the neonatal period to puberty. Remarkable jumps with absolute zlog values of more than five were seen for 29 out of 192 reference limits (15.1%). This might be attenuated by introducing shorter time periods or mathematical functions of reference limits over age. CONCLUSIONS: Age-partitioned reference intervals will remain the standard in laboratory routine for the foreseeable future, and as such, algorithmic approaches like our zlog approach in the presented Shiny application will remain valuable tools for testing their plausibility on a wide scale.


Subject(s)
Algorithms , Adolescent , Infant, Newborn , Child , Humans , Reference Values , Reference Standards , Canada
6.
Brain ; 145(6): 1992-2007, 2022 06 30.
Article in English | MEDLINE | ID: mdl-35511193

ABSTRACT

Cerebral small vessel disease is a leading cause of stroke and a major contributor to cognitive decline and dementia, but our understanding of specific genes underlying the cause of sporadic cerebral small vessel disease is limited. We report a genome-wide association study and a whole-exome association study on a composite extreme phenotype of cerebral small vessel disease derived from its most common MRI features: white matter hyperintensities and lacunes. Seventeen population-based cohorts of older persons with MRI measurements and genome-wide genotyping (n = 41 326), whole-exome sequencing (n = 15 965), or exome chip (n = 5249) data contributed 13 776 and 7079 extreme small vessel disease samples for the genome-wide association study and whole-exome association study, respectively. The genome-wide association study identified significant association of common variants in 11 loci with extreme small vessel disease, of which the chr12q24.11 locus was not previously reported to be associated with any MRI marker of cerebral small vessel disease. The whole-exome association study identified significant associations of extreme small vessel disease with common variants in the 5' UTR region of EFEMP1 (chr2p16.1) and one probably damaging common missense variant in TRIM47 (chr17q25.1). Mendelian randomization supports the causal association of extensive small vessel disease severity with increased risk of stroke and Alzheimer's disease. Combined evidence from summary-based Mendelian randomization studies and profiling of human loss-of-function allele carriers showed an inverse relation between TRIM47 expression in the brain and blood vessels and extensive small vessel disease severity. We observed significant enrichment of Trim47 in isolated brain vessel preparations compared to total brain fraction in mice, in line with the literature showing Trim47 enrichment in brain endothelial cells at single cell level. Functional evaluation of TRIM47 by small interfering RNAs-mediated knockdown in human brain endothelial cells showed increased endothelial permeability, an important hallmark of cerebral small vessel disease pathology. Overall, our comprehensive gene-mapping study and preliminary functional evaluation suggests a putative role of TRIM47 in the pathophysiology of cerebral small vessel disease, making it an important candidate for extensive in vivo explorations and future translational work.


Subject(s)
Brain Ischemia , Cerebral Small Vessel Diseases , Stroke , Animals , Brain Ischemia/complications , Cerebral Small Vessel Diseases/complications , Cerebral Small Vessel Diseases/diagnostic imaging , Cerebral Small Vessel Diseases/genetics , Endothelial Cells/pathology , Genome-Wide Association Study , Mice , Stroke/complications
7.
Nucleic Acids Res ; 49(D1): D309-D318, 2021 01 08.
Article in English | MEDLINE | ID: mdl-32976589

ABSTRACT

Alternative splicing plays a major role in regulating the functional repertoire of the proteome. However, isoform-specific effects to protein-protein interactions (PPIs) are usually overlooked, making it impossible to judge the functional role of individual exons on a systems biology level. We overcome this barrier by integrating protein-protein interactions, domain-domain interactions and residue-level interactions information to lift exon expression analysis to a network level. Our user-friendly database DIGGER is available at https://exbio.wzw.tum.de/digger and allows users to seamlessly switch between isoform and exon-centric views of the interactome and to extract sub-networks of relevant isoforms, making it an essential resource for studying mechanistic consequences of alternative splicing.


Subject(s)
Alternative Splicing , Databases, Protein , Exons , Protein Interaction Mapping/methods , Proteome/chemistry , RNA, Messenger/genetics , Binding Sites , Computational Biology/methods , Humans , Internet , Models, Molecular , Protein Binding , Protein Biosynthesis , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Interaction Domains and Motifs , Protein Isoforms , Proteome/genetics , Proteome/metabolism , RNA, Messenger/metabolism , Software , Thermodynamics
8.
BMC Med Ethics ; 24(1): 48, 2023 07 06.
Article in English | MEDLINE | ID: mdl-37415172

ABSTRACT

BACKGROUND: Healthcare providers have to make ethically complex clinical decisions which may be a source of stress. Researchers have recently introduced Artificial Intelligence (AI)-based applications to assist in clinical ethical decision-making. However, the use of such tools is controversial. This review aims to provide a comprehensive overview of the reasons given in the academic literature for and against their use. METHODS: PubMed, Web of Science, Philpapers.org and Google Scholar were searched for all relevant publications. The resulting set of publications was title and abstract screened according to defined inclusion and exclusion criteria, resulting in 44 papers whose full texts were analysed using the Kuckartz method of qualitative text analysis. RESULTS: Artificial Intelligence might increase patient autonomy by improving the accuracy of predictions and allowing patients to receive their preferred treatment. It is thought to increase beneficence by providing reliable information, thereby, supporting surrogate decision-making. Some authors fear that reducing ethical decision-making to statistical correlations may limit autonomy. Others argue that AI may not be able to replicate the process of ethical deliberation because it lacks human characteristics. Concerns have been raised about issues of justice, as AI may replicate existing biases in the decision-making process. CONCLUSIONS: The prospective benefits of using AI in clinical ethical decision-making are manifold, but its development and use should be undertaken carefully to avoid ethical pitfalls. Several issues that are central to the discussion of Clinical Decision Support Systems, such as justice, explicability or human-machine interaction, have been neglected in the debate on AI for clinical ethics so far. TRIAL REGISTRATION: This review is registered at Open Science Framework ( https://osf.io/wvcs9 ).


Subject(s)
Artificial Intelligence , Clinical Decision-Making , Humans , Beneficence
9.
J Med Internet Res ; 25: e42621, 2023 07 12.
Article in English | MEDLINE | ID: mdl-37436815

ABSTRACT

BACKGROUND: Machine learning and artificial intelligence have shown promising results in many areas and are driven by the increasing amount of available data. However, these data are often distributed across different institutions and cannot be easily shared owing to strict privacy regulations. Federated learning (FL) allows the training of distributed machine learning models without sharing sensitive data. In addition, the implementation is time-consuming and requires advanced programming skills and complex technical infrastructures. OBJECTIVE: Various tools and frameworks have been developed to simplify the development of FL algorithms and provide the necessary technical infrastructure. Although there are many high-quality frameworks, most focus only on a single application case or method. To our knowledge, there are no generic frameworks, meaning that the existing solutions are restricted to a particular type of algorithm or application field. Furthermore, most of these frameworks provide an application programming interface that needs programming knowledge. There is no collection of ready-to-use FL algorithms that are extendable and allow users (eg, researchers) without programming knowledge to apply FL. A central FL platform for both FL algorithm developers and users does not exist. This study aimed to address this gap and make FL available to everyone by developing FeatureCloud, an all-in-one platform for FL in biomedicine and beyond. METHODS: The FeatureCloud platform consists of 3 main components: a global frontend, a global backend, and a local controller. Our platform uses a Docker to separate the local acting components of the platform from the sensitive data systems. We evaluated our platform using 4 different algorithms on 5 data sets for both accuracy and runtime. RESULTS: FeatureCloud removes the complexity of distributed systems for developers and end users by providing a comprehensive platform for executing multi-institutional FL analyses and implementing FL algorithms. Through its integrated artificial intelligence store, federated algorithms can easily be published and reused by the community. To secure sensitive raw data, FeatureCloud supports privacy-enhancing technologies to secure the shared local models and assures high standards in data privacy to comply with the strict General Data Protection Regulation. Our evaluation shows that applications developed in FeatureCloud can produce highly similar results compared with centralized approaches and scale well for an increasing number of participating sites. CONCLUSIONS: FeatureCloud provides a ready-to-use platform that integrates the development and execution of FL algorithms while reducing the complexity to a minimum and removing the hurdles of federated infrastructure. Thus, we believe that it has the potential to greatly increase the accessibility of privacy-preserving and distributed data analyses in biomedicine and beyond.


Subject(s)
Algorithms , Artificial Intelligence , Humans , Health Occupations , Software , Computer Communication Networks , Privacy
10.
Bioinformatics ; 37(12): 1708-1716, 2021 07 19.
Article in English | MEDLINE | ID: mdl-33252645

ABSTRACT

MOTIVATION: Recently, various tools for detecting single nucleotide polymorphisms (SNPs) involved in epistasis have been developed. However, no studies evaluate the employed statistical epistasis models such as the χ2-test or quadratic regression independently of the tools that use them. Such an independent evaluation is crucial for developing improved epistasis detection tools, for it allows to decide if a tool's performance should be attributed to the epistasis model or to the optimization strategy run on top of it. RESULTS: We present a protocol for evaluating epistasis models independently of the tools they are used in and generalize existing models designed for dichotomous phenotypes to the categorical and quantitative case. In addition, we propose a new model which scores candidate SNP sets by computing maximum likelihood distributions for the observed phenotypes in the cells of their penetrance tables. Extensive experiments show that the proposed maximum likelihood model outperforms three widely used epistasis models in most cases. The experiments also provide valuable insights into the properties of existing models, for instance, that quadratic regression perform particularly well on instances with quantitative phenotypes. AVAILABILITY AND IMPLEMENTATION: The evaluation protocol and all compared models are implemented in C++ and are supported under Linux and macOS. They are available at https://github.com/baumbachlab/genepiseeker/, along with test datasets and scripts to reproduce the experiments. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Epistasis, Genetic , Polymorphism, Single Nucleotide , Phenotype , Probability
11.
Bioinformatics ; 37(18): 3008-3010, 2021 09 29.
Article in English | MEDLINE | ID: mdl-33647976

ABSTRACT

SUMMARY: A plethora of tools exist for RNA-Seq data analysis with a focus on alternative splicing (AS). However, appropriate data for their comparative evaluation is missing. The R package ASimulatoR simulates gold standard RNA-Seq datasets with fine-grained control over the distribution of AS events, which allow for evaluating alternative splicing tools, e.g. to study the effect of sequencing depth on the performance of AS event detection. AVAILABILITY AND IMPLEMENTATION: ASimulatoR is freely available at https://github.com/biomedbigdata/ASimulatoR as an R package under GPL-3 license.


Subject(s)
Alternative Splicing , Software , RNA-Seq , Sequence Analysis, RNA , Computer Simulation
12.
Bioinformatics ; 37(16): 2398-2404, 2021 Aug 25.
Article in English | MEDLINE | ID: mdl-33367514

ABSTRACT

MOTIVATION: Unsupervised learning approaches are frequently used to stratify patients into clinically relevant subgroups and to identify biomarkers such as disease-associated genes. However, clustering and biclustering techniques are oblivious to the functional relationship of genes and are thus not ideally suited to pinpoint molecular mechanisms along with patient subgroups. RESULTS: We developed the network-constrained biclustering approach Biclustering Constrained by Networks (BiCoN) which (i) restricts biclusters to functionally related genes connected in molecular interaction networks and (ii) maximizes the difference in gene expression between two subgroups of patients. This allows BiCoN to simultaneously pinpoint molecular mechanisms responsible for the patient grouping. Network-constrained clustering of genes makes BiCoN more robust to noise and batch effects than typical clustering and biclustering methods. BiCoN can faithfully reproduce known disease subtypes as well as novel, clinically relevant patient subgroups, as we could demonstrate using breast and lung cancer datasets. In summary, BiCoN is a novel systems medicine tool that combines several heuristic optimization strategies for robust disease mechanism extraction. BiCoN is well-documented and freely available as a python package or a web interface. AVAILABILITY AND IMPLEMENTATION: PyPI package: https://pypi.org/project/bicon. WEB INTERFACE: https://exbio.wzw.tum.de/bicon. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

13.
Gut ; 70(3): 522-530, 2021 03.
Article in English | MEDLINE | ID: mdl-33168600

ABSTRACT

OBJECTIVE: The intestinal microbiome affects the prevalence and pathophysiology of a variety of diseases ranging from inflammation to cancer. A reduced taxonomic or functional diversity of the microbiome was often observed in association with poorer health outcomes or disease in general. Conversely, factors or manifest diseases that determine the long-term stability or instability of the microbiome are largely unknown. We aimed to identify disease-relevant phenotypes associated with faecal microbiota (in-)stability. DESIGN: A total of 2564 paired faecal samples from 1282 participants of the population-based Study of Health in Pomerania (SHIP) were collected at a 5-year (median) interval and microbiota profiles determined by 16S rRNA gene sequencing. The changes in faecal microbiota over time were associated with highly standardised and comprehensive phenotypic data to determine factors related to microbiota (in-)stability. RESULTS: The overall microbiome landscape remained remarkably stable over time. The greatest microbiome instability was associated with factors contributing to metabolic syndrome such as fatty liver disease and diabetes mellitus. These, in turn, were associated with an increase in facultative pathogens such as Enterobacteriaceae or Escherichia/Shigella. Greatest stability of the microbiome was determined by higher initial alpha diversity, female sex, high household income and preserved exocrine pancreatic function. Participants who newly developed fatty liver disease or diabetes during the 5-year follow-up already displayed significant microbiota changes at study entry when the diseases were absent. CONCLUSION: This study identifies distinct components of metabolic liver disease to be associated with instability of the intestinal microbiome, increased abundance of facultative pathogens and thus greater susceptibility toward dysbiosis-associated diseases.


Subject(s)
Diabetes Mellitus/metabolism , Dysbiosis/complications , Exocrine Pancreatic Insufficiency/physiopathology , Gastrointestinal Microbiome , Liver Diseases/metabolism , Adult , Aged , Biodiversity , Feces/microbiology , Female , Gastrointestinal Microbiome/genetics , Germany , Humans , Income/statistics & numerical data , Longitudinal Studies , Male , Middle Aged , Phenotype , RNA, Ribosomal, 16S/analysis , Risk Factors , Sex Factors
14.
Am J Hum Genet ; 103(5): 691-706, 2018 11 01.
Article in English | MEDLINE | ID: mdl-30388399

ABSTRACT

C-reactive protein (CRP) is a sensitive biomarker of chronic low-grade inflammation and is associated with multiple complex diseases. The genetic determinants of chronic inflammation remain largely unknown, and the causal role of CRP in several clinical outcomes is debated. We performed two genome-wide association studies (GWASs), on HapMap and 1000 Genomes imputed data, of circulating amounts of CRP by using data from 88 studies comprising 204,402 European individuals. Additionally, we performed in silico functional analyses and Mendelian randomization analyses with several clinical outcomes. The GWAS meta-analyses of CRP revealed 58 distinct genetic loci (p < 5 × 10-8). After adjustment for body mass index in the regression analysis, the associations at all except three loci remained. The lead variants at the distinct loci explained up to 7.0% of the variance in circulating amounts of CRP. We identified 66 gene sets that were organized in two substantially correlated clusters, one mainly composed of immune pathways and the other characterized by metabolic pathways in the liver. Mendelian randomization analyses revealed a causal protective effect of CRP on schizophrenia and a risk-increasing effect on bipolar disorder. Our findings provide further insights into the biology of inflammation and could lead to interventions for treating inflammation and its clinical consequences.


Subject(s)
Genetic Loci/genetics , Inflammation/genetics , Metabolic Networks and Pathways/genetics , Adolescent , Adult , Aged , Aged, 80 and over , Biomarkers/metabolism , Bipolar Disorder/genetics , Bipolar Disorder/metabolism , Body Mass Index , C-Reactive Protein/genetics , Child , Female , Genome-Wide Association Study/methods , Humans , Inflammation/metabolism , Liver/metabolism , Liver/pathology , Male , Mendelian Randomization Analysis/methods , Middle Aged , Schizophrenia/genetics , Schizophrenia/metabolism , Young Adult
15.
Bioinformatics ; 36(19): 4957-4959, 2020 12 08.
Article in English | MEDLINE | ID: mdl-32289146

ABSTRACT

SUMMARY: Simulated data are crucial for evaluating epistasis detection tools in genome-wide association studies. Existing simulators are limited, as they do not account for linkage disequilibrium (LD), support limited interaction models of single nucleotide polymorphisms (SNPs) and only dichotomous phenotypes or depend on proprietary software. In contrast, EpiGEN supports SNP interactions of arbitrary order, produces realistic LD patterns and generates both categorical and quantitative phenotypes. AVAILABILITY AND IMPLEMENTATION: EpiGEN is implemented in Python 3 and is freely available at https://github.com/baumbachlab/epigen. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Epistasis, Genetic , Genome-Wide Association Study , Computer Simulation , Epigen , Polymorphism, Single Nucleotide , Software
16.
PLoS Comput Biol ; 16(2): e1007616, 2020 02.
Article in English | MEDLINE | ID: mdl-32012148

ABSTRACT

Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe "DeepWAS", a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to 61 regulatory SNPs, called dSNPs, were associated with multiple sclerosis (MS, 4,888 cases and 10,395 controls), major depressive disorder (MDD, 1,475 cases and 2,144 controls), and height (5,974 individuals). These variants were mainly non-coding and reached at least nominal significance in classical GWAS. The prediction accuracy was higher for DeepWAS than for classical GWAS models for 91% of the genome-wide significant, MS-specific dSNPs. DSNPs were enriched in public or cohort-matched expression and methylation quantitative trait loci and we demonstrated the potential of DeepWAS to generate testable functional hypotheses based on genotype data alone. DeepWAS is available at https://github.com/cellmapslab/DeepWAS.


Subject(s)
Deep Learning , Genetic Association Studies , Multivariate Analysis , Genome-Wide Association Study , Humans , Polymorphism, Single Nucleotide , Quantitative Trait Loci
17.
Nature ; 523(7561): 459-462, 2015 Jul 23.
Article in English | MEDLINE | ID: mdl-26131930

ABSTRACT

Homozygosity has long been associated with rare, often devastating, Mendelian disorders, and Darwin was one of the first to recognize that inbreeding reduces evolutionary fitness. However, the effect of the more distant parental relatedness that is common in modern human populations is less well understood. Genomic data now allow us to investigate the effects of homozygosity on traits of public health importance by observing contiguous homozygous segments (runs of homozygosity), which are inferred to be homozygous along their complete length. Given the low levels of genome-wide homozygosity prevalent in most human populations, information is required on very large numbers of people to provide sufficient power. Here we use runs of homozygosity to study 16 health-related quantitative traits in 354,224 individuals from 102 cohorts, and find statistically significant associations between summed runs of homozygosity and four complex traits: height, forced expiratory lung volume in one second, general cognitive ability and educational attainment (P < 1 × 10(-300), 2.1 × 10(-6), 2.5 × 10(-10) and 1.8 × 10(-10), respectively). In each case, increased homozygosity was associated with decreased trait value, equivalent to the offspring of first cousins being 1.2 cm shorter and having 10 months' less education. Similar effect sizes were found across four continental groups and populations with different degrees of genome-wide homozygosity, providing evidence that homozygosity, rather than confounding, directly contributes to phenotypic variance. Contrary to earlier reports in substantially smaller samples, no evidence was seen of an influence of genome-wide homozygosity on blood pressure and low density lipoprotein cholesterol, or ten other cardio-metabolic traits. Since directional dominance is predicted for traits under directional evolutionary selection, this study provides evidence that increased stature and cognitive function have been positively selected in human evolution, whereas many important risk factors for late-onset complex diseases may not have been.


Subject(s)
Body Height/genetics , Cognition , Homozygote , Biological Evolution , Blood Pressure/genetics , Cholesterol, LDL/genetics , Cohort Studies , Educational Status , Female , Forced Expiratory Volume/genetics , Genome, Human/genetics , Humans , Lung Volume Measurements , Male , Phenotype
18.
Gastroenterology ; 156(4): 1010-1015, 2019 03.
Article in English | MEDLINE | ID: mdl-30391469

ABSTRACT

BACKGROUND & AIMS: Changes in intestinal microbiome composition are associated with inflammatory, metabolic, and malignant disorders. We studied how exocrine pancreatic function affects intestinal microbiota. METHODS: We performed 16S ribosomal RNA gene sequencing analysis of stool samples from 1795 volunteers from the population-based Study of Health in Pomerania who had no history of pancreatic disease. We also measured fecal pancreatic elastase by enzyme-linked immunosorbent assay and performed quantitative imaging of secretin-stimulated pancreatic fluid secretion. Associations of exocrine pancreatic function with microbial diversity or individual genera were calculated by permutational analysis of variance or linear regression, respectively. RESULTS: Differences in pancreatic elastase levels associated with significantly (P < .0001) greater changes in microbiota diversity than with participant age, body mass index, sex, smoking, alcohol consumption, or dietary factors. Significant changes in the abundance of 30 taxa, such as an increase in Prevotella (q < .0001) and a decrease of Bacteroides (q < .0001), indicated a shift from a type-1 to a type-2 enterotype. Changes in pancreatic fluid secretion alone were also associated with changes in microbial diversity (P = .0002), although to a lesser degree. CONCLUSIONS: In an analysis of fecal samples from 1795 volunteers, pancreatic acinar cell, rather than duct cell, function is presently the single most significant host factor to be associated with changes in intestinal microbiota composition.


Subject(s)
Bacteria/isolation & purification , Exocrine Pancreatic Insufficiency/physiopathology , Feces/enzymology , Gastrointestinal Microbiome , Pancreas/physiopathology , Pancreatic Elastase/metabolism , Acinar Cells/physiology , Bacteroides/isolation & purification , Biodiversity , Host Microbial Interactions , Humans , Pancreas/cytology , Pancreatic Function Tests , Prevotella/isolation & purification , RNA, Ribosomal, 16S/analysis
19.
Brief Bioinform ; 19(1): 77-88, 2018 01 01.
Article in English | MEDLINE | ID: mdl-27742665

ABSTRACT

Systems biology models are rapidly increasing in complexity, size and numbers. When building large models, researchers rely on software tools for the retrieval, comparison, combination and merging of models, as well as for version control. These tools need to be able to quantify the differences and similarities between computational models. However, depending on the specific application, the notion of 'similarity' may greatly vary. A general notion of model similarity, applicable to various types of models, is still missing. Here we survey existing methods for the comparison of models, introduce quantitative measures for model similarity, and discuss potential applications of combined similarity measures. To frame model comparison as a general problem, we describe a theoretical approach to defining and computing similarities based on a combination of different model aspects. The six aspects that we define as potentially relevant for similarity are underlying encoding, references to biological entities, quantitative behaviour, qualitative behaviour, mathematical equations and parameters and network structure. We argue that future similarity measures will benefit from combining these model aspects in flexible, problem-specific ways to mimic users' intuition about model similarity, and to support complex model searches in databases.


Subject(s)
Computational Biology/methods , Computer Simulation , Models, Biological , Software , Systems Biology/methods , Animals , Databases, Factual , Humans , Signal Transduction , User-Computer Interface
20.
Am J Hum Genet ; 99(1): 40-55, 2016 Jul 07.
Article in English | MEDLINE | ID: mdl-27346686

ABSTRACT

Platelet production, maintenance, and clearance are tightly controlled processes indicative of platelets' important roles in hemostasis and thrombosis. Platelets are common targets for primary and secondary prevention of several conditions. They are monitored clinically by complete blood counts, specifically with measurements of platelet count (PLT) and mean platelet volume (MPV). Identifying genetic effects on PLT and MPV can provide mechanistic insights into platelet biology and their role in disease. Therefore, we formed the Blood Cell Consortium (BCX) to perform a large-scale meta-analysis of Exomechip association results for PLT and MPV in 157,293 and 57,617 individuals, respectively. Using the low-frequency/rare coding variant-enriched Exomechip genotyping array, we sought to identify genetic variants associated with PLT and MPV. In addition to confirming 47 known PLT and 20 known MPV associations, we identified 32 PLT and 18 MPV associations not previously observed in the literature across the allele frequency spectrum, including rare large effect (FCER1A), low-frequency (IQGAP2, MAP1A, LY75), and common (ZMIZ2, SMG6, PEAR1, ARFGAP3/PACSIN2) variants. Several variants associated with PLT/MPV (PEAR1, MRVI1, PTGES3) were also associated with platelet reactivity. In concurrent BCX analyses, there was overlap of platelet-associated variants with red (MAP1A, TMPRSS6, ZMIZ2) and white (PEAR1, ZMIZ2, LY75) blood cell traits, suggesting common regulatory pathways with shared genetic architecture among these hematopoietic lineages. Our large-scale Exomechip analyses identified previously undocumented associations with platelet traits and further indicate that several complex quantitative hematological, lipid, and cardiovascular traits share genetic factors.


Subject(s)
Blood Platelets/metabolism , Exome/genetics , Genetic Variation/genetics , Female , Genome-Wide Association Study , Humans , Male , Mean Platelet Volume , Platelet Count
SELECTION OF CITATIONS
SEARCH DETAIL