Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
Add more filters










Publication year range
1.
Bioinformatics ; 38(Suppl 1): i350-i358, 2022 06 24.
Article in English | MEDLINE | ID: mdl-35758817

ABSTRACT

MOTIVATION: Estimating causal queries, such as changes in protein abundance in response to a perturbation, is a fundamental task in the analysis of biomolecular pathways. The estimation requires experimental measurements on the pathway components. However, in practice many pathway components are left unobserved (latent) because they are either unknown, or difficult to measure. Latent variable models (LVMs) are well-suited for such estimation. Unfortunately, LVM-based estimation of causal queries can be inaccurate when parameters of the latent variables are not uniquely identified, or when the number of latent variables is misspecified. This has limited the use of LVMs for causal inference in biomolecular pathways. RESULTS: In this article, we propose a general and practical approach for LVM-based estimation of causal queries. We prove that, despite the challenges above, LVM-based estimators of causal queries are accurate if the queries are identifiable according to Pearl's do-calculus and describe an algorithm for its estimation. We illustrate the breadth and the practical utility of this approach for estimating causal queries in four synthetic and two experimental case studies, where structures of biomolecular pathways challenge the existing methods for causal query estimation. AVAILABILITY AND IMPLEMENTATION: The code and the data documenting all the case studies are available at https://github.com/srtaheri/LVMwithDoCalculus. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , Calculi , Humans , Models, Theoretical , Proteins
2.
AMIA Jt Summits Transl Sci Proc ; 2021: 257-266, 2021.
Article in English | MEDLINE | ID: mdl-34457140

ABSTRACT

There is a controversy in the diagnosis and treatment of hypothyroidism. We propose the disagreement is fueled by statistical paradoxes, and sampling biases that provide different perspectives depending upon the sample selection criteria. The statistical inconsistencies become more apparent when viewed using a causal lens. Foundational hypothyroid research does not reflect the current Levothyroxine treated population. Exploration of empirical data demonstrates an apparent breakdown of the T4 to T3 causal pathway in the treated population. This use case demonstrates the difficulty of translating controlled research into clinical practices for patients with multiple comorbid conditions. We make the case for redundancy in data collection, ongoing attempts to falsify current assumptions and the need for causal approaches to validate the results of controlled research in clinical settings, in order to avoid confirmation bias from statistically insufficient biometrics.


Subject(s)
Hypothyroidism , Triiodothyronine , Data Collection , Humans , Hypothyroidism/diagnosis , Hypothyroidism/drug therapy , Thyroxine
3.
IEEE Trans Big Data ; 7(1): 25-37, 2021 Mar 01.
Article in English | MEDLINE | ID: mdl-37981991

ABSTRACT

Counterfactual inference is a useful tool for comparing outcomes of interventions on complex systems. It requires us to represent the system in form of a structural causal model, complete with a causal diagram, probabilistic assumptions on exogenous variables, and functional assignments. Specifying such models can be extremely difficult in practice. The process requires substantial domain expertise, and does not scale easily to large systems, multiple systems, or novel system modifications. At the same time, many application domains, such as molecular biology, are rich in structured causal knowledge that is qualitative in nature. This article proposes a general approach for querying a causal biological knowledge graph, and converting the qualitative result into a quantitative structural causal model that can learn from data to answer the question. We demonstrate the feasibility, accuracy and versatility of this approach using two case studies in systems biology. The first demonstrates the appropriateness of the underlying assumptions and the accuracy of the results. The second demonstrates the versatility of the approach by querying a knowledge base for the molecular determinants of a severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)-induced cytokine storm, and performing counterfactual inference to estimate the causal effect of medical countermeasures for severely ill patients.

4.
Evol Appl ; 12(3): 384-398, 2019 Mar.
Article in English | MEDLINE | ID: mdl-30828362

ABSTRACT

Urban ecosystems are rapidly expanding throughout the world, but how urban growth affects the evolutionary ecology of species living in urban areas remains largely unknown. Urban ecology has advanced our understanding of how the development of cities and towns change environmental conditions and alter ecological processes and patterns. However, despite decades of research in urban ecology, the extent to which urbanization influences evolutionary and eco-evolutionary change has received little attention. The nascent field of urban evolutionary ecology seeks to understand how urbanization affects the evolution of populations, and how those evolutionary changes in turn influence the ecological dynamics of populations, communities, and ecosystems. Following a brief history of this emerging field, this Perspective article provides a research agenda and roadmap for future research aimed at advancing our understanding of the interplay between ecology and evolution of urban-dwelling organisms. We identify six key questions that, if addressed, would significantly increase our understanding of how urbanization influences evolutionary processes. These questions consider how urbanization affects nonadaptive evolution, natural selection, and convergent evolution, in addition to the role of urban environmental heterogeneity on species evolution, and the roles of phenotypic plasticity versus adaptation on species' abundance in cities. Our final question examines the impact of urbanization on evolutionary diversification. For each of these six questions, we suggest avenues for future research that will help advance the field of urban evolutionary ecology. Lastly, we highlight the importance of integrating urban evolutionary ecology into urban planning, conservation practice, pest management, and public engagement.

5.
J Comput Biol ; 25(7): 709-725, 2018 07.
Article in English | MEDLINE | ID: mdl-29927613

ABSTRACT

Machine learning methods for learning network structure are applied to quantitative proteomics experiments and reverse-engineer intracellular signal transduction networks. They provide insight into the rewiring of signaling within the context of a disease or a phenotype. To learn the causal patterns of influence between proteins in the network, the methods require experiments that include targeted interventions that fix the activity of specific proteins. However, the interventions are costly and add experimental complexity. We describe an active learning strategy for selecting optimal interventions. Our approach takes as inputs pathway databases and historic data sets, expresses them in form of prior probability distributions on network structures, and selects interventions that maximize their expected contribution to structure learning. Evaluations on simulated and real data show that the strategy reduces the detection error of validated edges as compared with an unguided choice of interventions and avoids redundant interventions, thereby increasing the effectiveness of the experiment.


Subject(s)
Bayes Theorem , Computational Biology/statistics & numerical data , Gene Expression Profiling/methods , Gene Regulatory Networks/genetics , Algorithms , Machine Learning , Models, Statistical , Signal Transduction
6.
Evolution ; 71(12): 2918-2929, 2017 Dec.
Article in English | MEDLINE | ID: mdl-28884790

ABSTRACT

Although all genetic variation ultimately stems from mutations, their properties are difficult to study directly. Here, we used multiple mutation accumulation (MA) lines derived from five genetic backgrounds of the green algae Chlamydomonas reinhardtii that have been previously subjected to whole genome sequencing to investigate the relationship between the number of spontaneous mutations and change in fitness from a nonevolved ancestor. MA lines were on average less fit than their ancestors and we detected a significantly negative correlation between the change in fitness and the total number of accumulated mutations in the genome. Likewise, the number of mutations located within coding regions significantly and negatively impacted MA line fitness. We used the fitness data to parameterize a maximum likelihood model to estimate discrete categories of mutational effects, and found that models containing one to two mutational effect categories (one neutral and one deleterious category) fitted the data best. However, the best-fitting mutational effects models were highly dependent on the genetic background of the ancestral strain.


Subject(s)
Chlamydomonas reinhardtii/genetics , Genetic Fitness , Mutation Accumulation , Chlamydomonas reinhardtii/growth & development , Chlamydomonas reinhardtii/physiology , Gene-Environment Interaction , Genetic Variation , Models, Genetic , Selection, Genetic , Stress, Physiological
7.
J Proteome Res ; 15(3): 683-90, 2016 Mar 04.
Article in English | MEDLINE | ID: mdl-26731284

ABSTRACT

Causal inference, the task of uncovering regulatory relationships between components of biomolecular pathways and networks, is a primary goal of many high-throughput investigations. Statistical associations between observed protein concentrations can suggest an enticing number of hypotheses regarding the underlying causal interactions, but when do such associations reflect the underlying causal biomolecular mechanisms? The goal of this perspective is to provide suggestions for causal inference in large-scale experiments, which utilize high-throughput technologies such as mass-spectrometry-based proteomics. We describe in nontechnical terms the pitfalls of inference in large data sets and suggest methods to overcome these pitfalls and reliably find regulatory associations.


Subject(s)
Causality , Computational Biology/methods , Gene Regulatory Networks , Models, Statistical , Animals , Humans , Metabolic Networks and Pathways , Systems Biology/methods , Systems Integration
10.
Am J Orthopsychiatry ; 50(2): 302-315, 1980 Apr.
Article in English | MEDLINE | ID: mdl-7361878

ABSTRACT

The relationship between religious activity within a Pentecostal congregation and the emotional status of the congregants is described. Data derived from a field study conducted in a Newfoundland coastal community. The more frequently people engaged in religious activities, the less likely they were to report symptoms of emotional distress. Significant within-group variation was found in terms of the frequency and type of religious activity.


Subject(s)
Mental Healing , Religion and Psychology , Anxiety/psychology , Female , Humans , Male , Newfoundland and Labrador , Psychophysiologic Disorders/psychology , Research , Social Adjustment
SELECTION OF CITATIONS
SEARCH DETAIL
...