Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Res Sq ; 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38826481

RESUMO

Background: Epistasis, the phenomenon where the effect of one gene (or variant) is masked or modified by one or more other genes, can significantly contribute to the observed phenotypic variance of complex traits. To date, it has been generally assumed that genetic interactions can be detected using a Cartesian, or multiplicative, interaction model commonly utilized in standard regression approaches. However, a recent study investigating epistasis in obesity-related traits in rats and mice has identified potential limitations of the Cartesian model, revealing that it only detects some of the genetic interactions occurring in these systems. By applying an alternative approach, the exclusive-or (XOR) model, the researchers detected a greater number of epistatic interactions and identified more biologically relevant ontological terms associated with the interacting loci. This suggests that the XOR model may provide a more comprehensive understanding of epistasis in these species and phenotypes. To further explore these findings and determine if different interaction models also make up distinct epistatic networks, we leverage network science to provide a more comprehensive view into the genetic interactions underlying BMI in this system. Results: Our comparative analysis of networks derived from Cartesian and XOR interaction models in rats (Rattus norvegicus) uncovers distinct topological characteristics for each model-derived network. Notably, we discover that networks based on the XOR model exhibit an enhanced sensitivity to epistatic interactions. This sensitivity enables the identification of network communities, revealing novel trait-related biological functions through enrichment analysis. Furthermore, we identify triangle network motifs in the XOR epistatic network, suggestive of higher-order epistasis, based on the topology of lower-order epistasis. Conclusions: These findings highlight the XOR model's ability to uncover meaningful biological associations as well as higher-order epistasis from lower-order epistatic networks. Additionally, our results demonstrate that network approaches not only enhance epistasis detection capabilities but also provide more nuanced understandings of genetic architectures underlying complex traits. The identification of community structures and motifs within these distinct networks, especially in XOR, points to the potential for network science to aid in the discovery of novel genetic pathways and regulatory networks. Such insights are important for advancing our understanding of phenotype-genotype relationships.

2.
BioData Min ; 17(1): 7, 2024 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-38419006

RESUMO

PURPOSE: Epistasis, the interaction between two or more genes, is integral to the study of genetics and is present throughout nature. Yet, it is seldom fully explored as most approaches primarily focus on single-locus effects, partly because analyzing all pairwise and higher-order interactions requires significant computational resources. Furthermore, existing methods for epistasis detection only consider a Cartesian (multiplicative) model for interaction terms. This is likely limiting as epistatic interactions can evolve to produce varied relationships between genetic loci, some complex and not linearly separable. METHODS: We present new algorithms for the interaction coefficients for standard regression models for epistasis that permit many varied models for the interaction terms for loci and efficient memory usage. The algorithms are given for two-way and three-way epistasis and may be generalized to higher order epistasis. Statistical tests for the interaction coefficients are also provided. We also present an efficient matrix based algorithm for permutation testing for two-way epistasis. We offer a proof and experimental evidence that methods that look for epistasis only at loci that have main effects may not be justified. Given the computational efficiency of the algorithm, we applied the method to a rat data set and mouse data set, with at least 10,000 loci and 1,000 samples each, using the standard Cartesian model and the XOR model to explore body mass index. RESULTS: This study reveals that although many of the loci found to exhibit significant statistical epistasis overlap between models in rats, the pairs are mostly distinct. Further, the XOR model found greater evidence for statistical epistasis in many more pairs of loci in both data sets with almost all significant epistasis in mice identified using XOR. In the rat data set, loci involved in epistasis under the XOR model are enriched for biologically relevant pathways. CONCLUSION: Our results in both species show that many biologically relevant epistatic relationships would have been undetected if only one interaction model was applied, providing evidence that varied interaction models should be implemented to explore epistatic interactions that occur in living systems.

3.
medRxiv ; 2024 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-38370703

RESUMO

Background: Social determinants of health (SDoH) like socioeconomics and neighborhoods strongly influence outcomes, yet standardized SDoH data is lacking in electronic health records (EHR), limiting research and care quality. Methods: We searched PubMed using keywords "SDOH" and "EHR", underwent title/abstract and full-text screening. Included records were analyzed under five domains: 1) SDoH screening and assessment approaches, 2) SDoH data collection and documentation, 3) Use of natural language processing (NLP) for extracting SDoH, 4) SDoH data and health outcomes, and 5) SDoH-driven interventions. Results: We identified 685 articles, of which 324 underwent full review. Key findings include tailored screening instruments implemented across settings, census and claims data linkage providing contextual SDoH profiles, rule-based and neural network systems extracting SDoH from notes using NLP, connections found between SDoH data and healthcare utilization/chronic disease control, and integrated care management programs executed. However, considerable variability persists across data sources, tools, and outcomes. Discussion: Despite progress identifying patient social needs, further development of standards, predictive models, and coordinated interventions is critical to fulfill the potential of SDoH-EHR integration. Additional database searches could strengthen this scoping review. Ultimately widespread capture, analysis, and translation of multidimensional SDoH data into clinical care is essential for promoting health equity.

4.
JMIR Ment Health ; 11: e53366, 2024 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-38224481

RESUMO

BACKGROUND: Information regarding opioid use disorder (OUD) status and severity is important for patient care. Clinical notes provide valuable information for detecting and characterizing problematic opioid use, necessitating development of natural language processing (NLP) tools, which in turn requires reliably labeled OUD-relevant text and understanding of documentation patterns. OBJECTIVE: To inform automated NLP methods, we aimed to develop and evaluate an annotation schema for characterizing OUD and its severity, and to document patterns of OUD-relevant information within clinical notes of heterogeneous patient cohorts. METHODS: We developed an annotation schema to characterize OUD severity based on criteria from the Diagnostic and Statistical Manual of Mental Disorders, 5th edition. In total, 2 annotators reviewed clinical notes from key encounters of 100 adult patients with varied evidence of OUD, including patients with and those without chronic pain, with and without medication treatment for OUD, and a control group. We completed annotations at the sentence level. We calculated severity scores based on annotation of note text with 18 classes aligned with criteria for OUD severity and determined positive predictive values for OUD severity. RESULTS: The annotation schema contained 27 classes. We annotated 1436 sentences from 82 patients; notes of 18 patients (11 of whom were controls) contained no relevant information. Interannotator agreement was above 70% for 11 of 15 batches of reviewed notes. Severity scores for control group patients were all 0. Among noncontrol patients, the mean severity score was 5.1 (SD 3.2), indicating moderate OUD, and the positive predictive value for detecting moderate or severe OUD was 0.71. Progress notes and notes from emergency department and outpatient settings contained the most and greatest diversity of information. Substance misuse and psychiatric classes were most prevalent and highly correlated across note types with high co-occurrence across patients. CONCLUSIONS: Implementation of the annotation schema demonstrated strong potential for inferring OUD severity based on key information in a small set of clinical notes and highlighting where such information is documented. These advancements will facilitate NLP tool development to improve OUD prevention, diagnosis, and treatment.


Assuntos
Dor Crônica , Transtornos Relacionados ao Uso de Opioides , Adulto , Humanos , Processamento de Linguagem Natural , Pacientes Ambulatoriais , Grupos Controle , Transtornos Relacionados ao Uso de Opioides/diagnóstico
5.
Pac Symp Biocomput ; 29: 359-373, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38160292

RESUMO

This work demonstrates the use of cluster analysis in detecting fair and unbiased novel discoveries. Given a sample population of elective spinal fusion patients, we identify two overarching subgroups driven by insurance type. The Medicare group, associated with lower socioeconomic status, exhibited an over-representation of negative risk factors. The findings provide a compelling depiction of the interwoven socioeconomic and racial disparities present within the healthcare system, highlighting their consequential effects on health inequalities. The results are intended to guide design of fair and precise machine learning models based on intentional integration of population stratification.


Assuntos
Medicare , Disparidades Socioeconômicas em Saúde , Idoso , Humanos , Estados Unidos , Biologia Computacional , Grupos Raciais , Análise por Conglomerados
6.
BioData Min ; 16(1): 14, 2023 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-37038201

RESUMO

BACKGROUND: Quantitative Trait Locus (QTL) analysis and Genome-Wide Association Studies (GWAS) have the power to identify variants that capture significant levels of phenotypic variance in complex traits. However, effort and time are required to select the best methods and optimize parameters and pre-processing steps. Although machine learning approaches have been shown to greatly assist in optimization and data processing, applying them to QTL analysis and GWAS is challenging due to the complexity of large, heterogenous datasets. Here, we describe proof-of-concept for an automated machine learning approach, AutoQTL, with the ability to automate many complicated decisions related to analysis of complex traits and generate solutions to describe relationships that exist in genetic data. RESULTS: Using a publicly available dataset of 18 putative QTL from a large-scale GWAS of body mass index in the laboratory rat, Rattus norvegicus, AutoQTL captures the phenotypic variance explained under a standard additive model. AutoQTL also detects evidence of non-additive effects including deviations from additivity and 2-way epistatic interactions in simulated data via multiple optimal solutions. Additionally, feature importance metrics provide different insights into the inheritance models and predictive power of multiple GWAS-derived putative QTL. CONCLUSIONS: This proof-of-concept illustrates that automated machine learning techniques can complement standard approaches and have the potential to detect both additive and non-additive effects via various optimal solutions and feature importance metrics. In the future, we aim to expand AutoQTL to accommodate omics-level datasets with intelligent feature selection and feature engineering strategies.

7.
bioRxiv ; 2023 Jan 13.
Artigo em Inglês | MEDLINE | ID: mdl-36711526

RESUMO

Background: Quantitative Trait Locus (QTL) analysis and Genome-Wide Association Studies (GWAS) have the power to identify variants that capture significant levels of phenotypic variance in complex traits. However, effort and time are required to select the best methods and optimize parameters and pre-processing steps. Although machine learning approaches have been shown to greatly assist in optimization and data processing, applying them to QTL analysis and GWAS is challenging due to the complexity of large, heterogenous datasets. Here, we describe proof-of-concept for an automated machine learning approach, AutoQTL, with the ability to automate many complex decisions related to analysis of complex traits and generate diverse solutions to describe relationships that exist in genetic data. Results: Using a dataset of 18 putative QTL from a large-scale GWAS of body mass index in the laboratory rat, Rattus norvegicus , AutoQTL captures the phenotypic variance explained under a standard additive model while also providing evidence of non-additive effects including deviations from additivity and 2-way epistatic interactions from simulated data via multiple optimal solutions. Additionally, feature importance metrics provide different insights into the inheritance models and predictive power of multiple GWAS-derived putative QTL. Conclusions: This proof-of-concept illustrates that automated machine learning techniques can be applied to genetic data and has the potential to detect both additive and non-additive effects via various optimal solutions and feature importance metrics. In the future, we aim to expand AutoQTL to accommodate omics-level datasets with intelligent feature selection strategies.

8.
BioData Min ; 15(1): 14, 2022 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-35840990

RESUMO

The opioid epidemic continues to contribute to loss of life through overdose and significant social and economic burdens. Many individuals who develop problematic opioid use (POU) do so after being exposed to prescribed opioid analgesics. Therefore, it is important to accurately identify and classify risk factors for POU. In this review, we discuss the etiology of POU and highlight novel approaches to identifying its risk factors. These approaches include the application of polygenic risk scores (PRS) and diverse machine learning (ML) algorithms used in tandem with data from electronic health records (EHR), clinical notes, patient demographics, and digital footprints. The implementation and synergy of these types of data and approaches can greatly assist in reducing the incidence of POU and opioid-related mortality by increasing the knowledge base of patient-related risk factors, which can help to improve prescribing practices for opioid analgesics.

9.
Front Public Health ; 10: 850619, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35615042

RESUMO

Background: Opioid use disorder (OUD) is underdiagnosed in health system settings, limiting research on OUD using electronic health records (EHRs). Medical encounter notes can enrich structured EHR data with documented signs and symptoms of OUD and social risks and behaviors. To capture this information at scale, natural language processing (NLP) tools must be developed and evaluated. We developed and applied an annotation schema to deeply characterize OUD and related clinical, behavioral, and environmental factors, and automated the annotation schema using machine learning and deep learning-based approaches. Methods: Using the MIMIC-III Critical Care Database, we queried hospital discharge summaries of patients with International Classification of Diseases (ICD-9) OUD diagnostic codes. We developed an annotation schema to characterize problematic opioid use, identify individuals with potential OUD, and provide psychosocial context. Two annotators reviewed discharge summaries from 100 patients. We randomly sampled patients with their associated annotated sentences and divided them into training (66 patients; 2,127 annotated sentences) and testing (29 patients; 1,149 annotated sentences) sets. We used the training set to generate features, employing three NLP algorithms/knowledge sources. We trained and tested prediction models for classification with a traditional machine learner (logistic regression) and deep learning approach (Autogluon based on ELECTRA's replaced token detection model). We applied a five-fold cross-validation approach to reduce bias in performance estimates. Results: The resulting annotation schema contained 32 classes. We achieved moderate inter-annotator agreement, with F1-scores across all classes increasing from 48 to 66%. Five classes had a sufficient number of annotations for automation; of these, we observed consistently high performance (F1-scores) across training and testing sets for drug screening (training: 91-96; testing: 91-94) and opioid type (training: 86-96; testing: 86-99). Performance dropped from training and to testing sets for other drug use (training: 52-65; testing: 40-48), pain management (training: 72-78; testing: 61-78) and psychiatric (training: 73-80; testing: 72). Autogluon achieved the highest performance. Conclusion: This pilot study demonstrated that rich information regarding problematic opioid use can be manually identified by annotators. However, more training samples and features would improve our ability to reliably identify less common classes from clinical text, including text from outpatient settings.


Assuntos
Processamento de Linguagem Natural , Transtornos Relacionados ao Uso de Opioides , Analgésicos Opioides , Hospitais , Humanos , Alta do Paciente , Projetos Piloto
10.
J Exp Biol ; 225(11)2022 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-35578907

RESUMO

Organisms with complex life cycles demonstrate a remarkable ability to change their phenotypes across development, presumably as an evolutionary adaptation to developmentally variable environments. Developmental variation in environmentally sensitive performance, and thermal sensitivity in particular, has been well documented in holometabolous insects. For example, thermal performance in adults and juvenile stages exhibit little genetic correlation (genetic decoupling) and can evolve independently, resulting in divergent thermal responses. Yet, we understand very little about how this genetic decoupling occurs. We tested the hypothesis that genetic decoupling of thermal physiology is driven by fundamental differences in physiology between life stages, despite a potentially conserved cellular stress response. We used RNAseq to compare transcript expression in response to a cold stressor in Drosophila melanogaster larvae and adults and used RNA interference (RNAi) to test whether knocking down nine target genes differentially affected larval and adult cold tolerance. Transcriptomic responses of whole larvae and adults during and following exposure to -5°C were largely unique both in identity of responding transcripts and in temporal dynamics. Further, we analyzed the tissue-specificity of differentially expressed transcripts from FlyAtlas 2 data, and concluded that stage-specific differences in transcription were not simply driven by differences in tissue composition. In addition, RNAi of target genes resulted in largely stage-specific and sometimes sex-specific effects on cold tolerance. The combined evidence suggests that thermal physiology is largely stage-specific at the level of gene expression, and thus natural selection may be acting on different loci during the independent thermal adaptation of different life stages.


Assuntos
Drosophila melanogaster , Transcriptoma , Animais , Drosophila melanogaster/genética , Feminino , Larva/genética , Estágios do Ciclo de Vida/genética , Masculino , Seleção Genética
11.
AMIA Annu Symp Proc ; 2022: 606-615, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-37128417

RESUMO

Our objective was to detect common barriers to post-acute care (B2PAC) among hospitalized older adults using natural language processing (NLP) of clinical notes from patients discharged home when a clinical decision support system recommended post-acute care. We annotated B2PAC sentences from discharge planning notes and developed an NLP classifier to identify the highest-value B2PAC class (negative patient preferences). Thirteen machine learning models were compared with Amazon's AutoGluon deep learning model. The study included 594 acute care notes from 100 patient encounters (1156 sentences contained 11 B2PAC) in a large academic health system. The most frequent and modifiable B2PAC class was negative patient preferences (18.3%). The best supervised model was Extreme Gradient Boosting (F1: 0.859), but the deep learning model performed better (F1: 0.916). Alerting clinicians of negative patient preferences early in the hospitalization can prompt interventions such as patient education to ensure patients receive the right level of care and avoid negative outcomes.


Assuntos
Processamento de Linguagem Natural , Preferência do Paciente , Humanos , Idoso , Cuidados Semi-Intensivos , Aprendizado de Máquina , Encaminhamento e Consulta , Registros Eletrônicos de Saúde
12.
Drug Alcohol Depend ; 221: 108602, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33652377

RESUMO

Opioid use disorder (OUD) creates significant public health and economic burdens worldwide. Therefore, understanding the risk factors that lead to the development of OUD is fundamental to reducing both its prevalence and its impact. Significant sources of OUD risk include co-occurring lifetime and current diagnoses of both psychiatric disorders, primarily mood disorders, and other substance use disorders, and unique and shared genetic factors. Although there appears to be pleiotropy between OUD and both mood and substance use disorders, this aspect of OUD risk is poorly understood. In this review, we describe the prevalence and clinical significance of addictive and affective comorbidities as risk factors for OUD development as a basis for rational opioid prescribing and OUD treatment and to improve efforts to prevent the disorder. We also review the genetic variants that have been associated with OUD and other addictive and affective disorders to highlight targets for future study and risk assessment protocols.


Assuntos
Analgésicos Opioides/uso terapêutico , Transtornos do Humor/epidemiologia , Transtornos Relacionados ao Uso de Opioides/tratamento farmacológico , Comportamento Aditivo , Comorbidade , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Transtornos do Humor/genética , Transtornos Relacionados ao Uso de Opioides/genética , Fenômica , Padrões de Prática Médica , Prevalência , Fatores de Risco
13.
Heredity (Edinb) ; 123(4): 479-491, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31164731

RESUMO

Environments often vary across a life cycle, imposing fluctuating natural selection across development. Such fluctuating selection can favor different phenotypes in different life stages, but stage-specific evolutionary responses will depend on genetic variance, covariance, and their interaction across development and across environments. Thus, quantifying how genetic architecture varies with plastic responses to the environment and across development is vital to predict whether stage-specific adaptation will occur in nature. Additionally, the interaction of genetic variation and environmental plasticity (GxE) may be stage-specific, leading to a three-way interaction between genotype, environment, and development or GxDxE. To test for these patterns, we exposed larvae and adults of Drosophila melanogaster isogenic lines derived from a natural population to extreme heat and cold stress after developmental acclimation to cool (18 °C) and warm (25 °C) conditions and measured genetic variance for thermal hardiness. We detected significant GxE that was specific to larvae and adults for cold and heat hardiness (GxDxE), but no significant genetic correlation across development for either trait at either acclimation temperature. However, cross-development phenotypic correlations for acclimation responses suggest that plasticity itself may be developmentally constrained, though rigorously testing this hypothesis requires more experimentation. These results illustrate the potential for stage-specific adaptation within a complex life cycle and demonstrate the importance of measuring traits at appropriate developmental stages and environmental conditions when predicting evolutionary responses to changing climates.


Assuntos
Adaptação Fisiológica/genética , Drosophila melanogaster/genética , Estágios do Ciclo de Vida/genética , Seleção Genética/genética , Animais , Mudança Climática , Temperatura Baixa , Interação Gene-Ambiente , Variação Genética/genética , Genótipo , Temperatura Alta , Larva/genética , Temperatura
14.
Integr Comp Biol ; 57(5): 999-1009, 2017 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-29045669

RESUMO

As organisms age the environment fluctuates, exerting differential selection across ontogeny. In particular, highly seasonal environments expose life stages to often drastically different thermal environments. This developmental variation is particularly striking in organisms with complex life cycles, wherein life history stages also exhibit distinct morphologies, physiologies, and behaviors. Genes acting pleiotropically on thermal responses may produce genetic correlations across ontogeny, constraining the independent evolution of each life stage to their respective thermal environments. To investigate whether developmental genetic correlations constrain the evolution thermal hardiness of the fly Drosophila melanogaster, we applied quantitative genetic analyses to cold hardiness measured in both larvae and adults from isogenic lines of the Drosophila Genetic Reference Panel (DGRP), using survival at stressful low temperatures as the phenotypic metric. Using full genome resequencing data for the DGRP, we also implemented genome-wide association (GWA) analysis using Bayesian Sparse Linear Mixed Models (BSLMMs) to estimate associations between naturally segregating variation and cold hardiness for both larvae and adults. Quantitative genetic analyses revealed no significant genetic correlation for cold hardiness between life stages, suggesting complete genetic decoupling of thermal hardiness across the metamorphic boundary. Both quantitative genetic and GWA analyses suggested that polygenic variation underlies cold hardiness in both stages, and that associated loci largely affected one stage or the other, but not both. However, reciprocal enrichment tests and correlations between BSLMM parameters for each life stage support some shared physiological mechanisms that may reflect common cellular thermal response pathways. Overall, these results suggest no developmental genetic constraints on cold hardiness across metamorphosis in D. melanogaster, an important consideration in evolutionary models of responses to changing climates. Genetic correlations for environmental sensitivity across ontogeny remains largely unexplored in other organisms, thus assessing the generality of genetic decoupling will require further quantitative or population genetic analysis in additional species.


Assuntos
Aclimatação , Drosophila melanogaster/fisiologia , Metamorfose Biológica , Termotolerância , Animais , Teorema de Bayes , Temperatura Baixa , Drosophila melanogaster/genética , Drosophila melanogaster/crescimento & desenvolvimento , Feminino , Estudo de Associação Genômica Ampla , Larva/genética , Larva/crescimento & desenvolvimento , Larva/fisiologia , Masculino , Fenótipo , Estações do Ano
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA