Pesquisa | Portal Regional da BVS

1.

Advancing LGBTQ+ inclusion in STEM education and AI research.

Wong, Emily; Urbanowicz, Ryan J; Bright, Tiffani J; Tatonetti, Nicholas P; Hsiao, Yi-Wen; Huang, Xiuzhen; Moore, Jason H; Peng, Pei-Chen.

Patterns (N Y) ; 5(6): 101010, 2024 Jun 14.

Artigo em Inglês | MEDLINE | ID: mdl-39005486

RESUMO

The authors emphasize diversity, equity, and inclusion in STEM education and artificial intelligence (AI) research, focusing on LGBTQ+ representation. They discuss the challenges faced by queer scientists, educational resources, the implementation of National AI Campus, and the notion of intersectionality. The authors hope to ensure supportive and respectful engagement across all communities.

2.

Distinct Network Patterns Emerge from Cartesian and XOR Epistasis Models: A Comparative Network Science Analysis.

Sha, Zhendong; Freda, Philip J; Bhandary, Priyanka; Ghosh, Attri; Matsumoto, Nicholas; Moore, Jason H; Hu, Ting.

Res Sq ; 2024 May 23.

Artigo em Inglês | MEDLINE | ID: mdl-38826481

RESUMO

Background: Epistasis, the phenomenon where the effect of one gene (or variant) is masked or modified by one or more other genes, can significantly contribute to the observed phenotypic variance of complex traits. To date, it has been generally assumed that genetic interactions can be detected using a Cartesian, or multiplicative, interaction model commonly utilized in standard regression approaches. However, a recent study investigating epistasis in obesity-related traits in rats and mice has identified potential limitations of the Cartesian model, revealing that it only detects some of the genetic interactions occurring in these systems. By applying an alternative approach, the exclusive-or (XOR) model, the researchers detected a greater number of epistatic interactions and identified more biologically relevant ontological terms associated with the interacting loci. This suggests that the XOR model may provide a more comprehensive understanding of epistasis in these species and phenotypes. To further explore these findings and determine if different interaction models also make up distinct epistatic networks, we leverage network science to provide a more comprehensive view into the genetic interactions underlying BMI in this system. Results: Our comparative analysis of networks derived from Cartesian and XOR interaction models in rats (Rattus norvegicus) uncovers distinct topological characteristics for each model-derived network. Notably, we discover that networks based on the XOR model exhibit an enhanced sensitivity to epistatic interactions. This sensitivity enables the identification of network communities, revealing novel trait-related biological functions through enrichment analysis. Furthermore, we identify triangle network motifs in the XOR epistatic network, suggestive of higher-order epistasis, based on the topology of lower-order epistasis. Conclusions: These findings highlight the XOR model's ability to uncover meaningful biological associations as well as higher-order epistasis from lower-order epistatic networks. Additionally, our results demonstrate that network approaches not only enhance epistasis detection capabilities but also provide more nuanced understandings of genetic architectures underlying complex traits. The identification of community structures and motifs within these distinct networks, especially in XOR, points to the potential for network science to aid in the discovery of novel genetic pathways and regulatory networks. Such insights are important for advancing our understanding of phenotype-genotype relationships.

3.

Sex classification of 3D skull images using deep neural networks.

Noel, Lake; Fat, Shelby Chun; Causey, Jason L; Dong, Wei; Stubblefield, Jonathan; Szymanski, Kathryn; Chang, Jui-Hsuan; Wang, Paul Zhiping; Moore, Jason H; Ray, Edward; Huang, Xiuzhen.

Sci Rep ; 14(1): 13707, 2024 06 14.

Artigo em Inglês | MEDLINE | ID: mdl-38877045

RESUMO

Determining the fundamental characteristics that define a face as "feminine" or "masculine" has long fascinated anatomists and plastic surgeons, particularly those involved in aesthetic and gender-affirming surgery. Previous studies in this area have relied on manual measurements, comparative anatomy, and heuristic landmark-based feature extraction. In this study, we collected retrospectively at Cedars Sinai Medical Center (CSMC) a dataset of 98 skull samples, which is the first dataset of this kind of 3D medical imaging. We then evaluated the accuracy of multiple deep learning neural network architectures on sex classification with this dataset. Specifically, we evaluated methods representing three different 3D data modeling approaches: Resnet3D, PointNet++, and MeshNet. Despite the limited number of imaging samples, our testing results show that all three approaches achieve AUC scores above 0.9 after convergence. PointNet++ exhibits the highest accuracy, while MeshNet has the lowest. Our findings suggest that accuracy is not solely dependent on the sparsity of data representation but also on the architecture design, with MeshNet's lower accuracy likely due to the lack of a hierarchical structure for progressive data abstraction. Furthermore, we studied a problem related to sex determination, which is the analysis of the various morphological features that affect sex classification. We proposed and developed a new method based on morphological gradients to visualize features that influence model decision making. The method based on morphological gradients is an alternative to the standard saliency map, and the new method provides better visualization of feature importance. Our study is the first to develop and evaluate deep learning models for analyzing 3D facial skull images to identify imaging feature differences between individuals assigned male or female at birth. These findings may be useful for planning and evaluating craniofacial surgery, particularly gender-affirming procedures, such as facial feminization surgery.

Assuntos

Aprendizado Profundo , Imageamento Tridimensional , Redes Neurais de Computação , Crânio , Humanos , Crânio/anatomia & histologia , Crânio/diagnóstico por imagem , Imageamento Tridimensional/métodos , Feminino , Masculino , Estudos Retrospectivos , Caracteres Sexuais , Adulto , Processamento de Imagem Assistida por Computador/métodos

4.

Using GPT-4 to write a scientific review article: a pilot evaluation study.

Wang, Zhiping Paul; Bhandary, Priyanka; Wang, Yizhou; Moore, Jason H.

BioData Min ; 17(1): 16, 2024 Jun 18.

Artigo em Inglês | MEDLINE | ID: mdl-38890715

RESUMO

GPT-4, as the most advanced version of OpenAI's large language models, has attracted widespread attention, rapidly becoming an indispensable AI tool across various areas. This includes its exploration by scientists for diverse applications. Our study focused on assessing GPT-4's capabilities in generating text, tables, and diagrams for biomedical review papers. We also assessed the consistency in text generation by GPT-4, along with potential plagiarism issues when employing this model for the composition of scientific review papers. Based on the results, we suggest the development of enhanced functionalities in ChatGPT, aiming to meet the needs of the scientific community more effectively. This includes enhancements in uploaded document processing for reference materials, a deeper grasp of intricate biomedical concepts, more precise and efficient information distillation for table generation, and a further refined model specifically tailored for scientific diagram creation.

5.

KRAGEN: a knowledge graph-enhanced RAG framework for biomedical problem solving using large language models.

Matsumoto, Nicholas; Moran, Jay; Choi, Hyunjun; Hernandez, Miguel E; Venkatesan, Mythreye; Wang, Paul; Moore, Jason H.

Bioinformatics ; 40(6)2024 Jun 03.

Artigo em Inglês | MEDLINE | ID: mdl-38830083

RESUMO

MOTIVATION: Answering and solving complex problems using a large language model (LLM) given a certain domain such as biomedicine is a challenging task that requires both factual consistency and logic, and LLMs often suffer from some major limitations, such as hallucinating false or irrelevant information, or being influenced by noisy data. These issues can compromise the trustworthiness, accuracy, and compliance of LLM-generated text and insights. RESULTS: Knowledge Retrieval Augmented Generation ENgine (KRAGEN) is a new tool that combines knowledge graphs, Retrieval Augmented Generation (RAG), and advanced prompting techniques to solve complex problems with natural language. KRAGEN converts knowledge graphs into a vector database and uses RAG to retrieve relevant facts from it. KRAGEN uses advanced prompting techniques: namely graph-of-thoughts (GoT), to dynamically break down a complex problem into smaller subproblems, and proceeds to solve each subproblem by using the relevant knowledge through the RAG framework, which limits the hallucinations, and finally, consolidates the subproblems and provides a solution. KRAGEN's graph visualization allows the user to interact with and evaluate the quality of the solution's GoT structure and logic. AVAILABILITY AND IMPLEMENTATION: KRAGEN is deployed by running its custom Docker containers. KRAGEN is available as open-source from GitHub at: https://github.com/EpistasisLab/KRAGEN.

Assuntos

Software , Processamento de Linguagem Natural , Resolução de Problemas , Algoritmos , Armazenamento e Recuperação da Informação/métodos , Humanos , Biologia Computacional/métodos , Bases de Dados Factuais

6.

PFERM: A Fair Empirical Risk Minimization Approach with Prior Knowledge.

Hou, Bojian; Mondragón, Andrés; Tarzanagh, Davoud Ataee; Zhou, Zhuoping; Saykin, Andrew J; Moore, Jason H; Ritchie, Marylyn D; Long, Qi; Shen, Li.

AMIA Jt Summits Transl Sci Proc ; 2024: 211-220, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38827072

RESUMO

Fairness is crucial in machine learning to prevent bias based on sensitive attributes in classifier predictions. However, the pursuit of strict fairness often sacrifices accuracy, particularly when significant prevalence disparities exist among groups, making classifiers less practical. For example, Alzheimer's disease (AD) is more prevalent in women than men, making equal treatment inequitable for females. Accounting for prevalence ratios among groups is essential for fair decision-making. In this paper, we introduce prior knowledge for fairness, which incorporates prevalence ratio information into the fairness constraint within the Empirical Risk Minimization (ERM) framework. We develop the Prior-knowledge-guided Fair ERM (PFERM) framework, aiming to minimize expected risk within a specified function class while adhering to a prior-knowledge-guided fairness constraint. This approach strikes a flexible balance between accuracy and fairness. Empirical results confirm its effectiveness in preserving fairness without compromising accuracy.

7.

Interpretable deep clustering survival machines for Alzheimer's disease subtype discovery.

Hou, Bojian; Wen, Zixuan; Bao, Jingxuan; Zhang, Richard; Tong, Boning; Yang, Shu; Wen, Junhao; Cui, Yuhan; Moore, Jason H; Saykin, Andrew J; Huang, Heng; Thompson, Paul M; Ritchie, Marylyn D; Davatzikos, Christos; Shen, Li.

Med Image Anal ; 97: 103231, 2024 Jun 14.

Artigo em Inglês | MEDLINE | ID: mdl-38941858

RESUMO

Alzheimer's disease (AD) is a complex neurodegenerative disorder that has impacted millions of people worldwide. The neuroanatomical heterogeneity of AD has made it challenging to fully understand the disease mechanism. Identifying AD subtypes during the prodromal stage and determining their genetic basis would be immensely valuable for drug discovery and subsequent clinical treatment. Previous studies that clustered subgroups typically used unsupervised learning techniques, neglecting the survival information and potentially limiting the insights gained. To address this problem, we propose an interpretable survival analysis method called Deep Clustering Survival Machines (DCSM), which combines both discriminative and generative mechanisms. Similar to mixture models, we assume that the timing information of survival data can be generatively described by a mixture of parametric distributions, referred to as expert distributions. We learn the weights of these expert distributions for individual instances in a discriminative manner by leveraging their features. This allows us to characterize the survival information of each instance through a weighted combination of the learned expert distributions. We demonstrate the superiority of the DCSM method by applying this approach to cluster patients with mild cognitive impairment (MCI) into subgroups with different risks of converting to AD. Conventional clustering measurements for survival analysis along with genetic association studies successfully validate the effectiveness of the proposed method and characterize our clustering findings.

8.

Centralized and Federated Models for the Analysis of Clinical Data.

Li, Ruowang; Romano, Joseph D; Chen, Yong; Moore, Jason H.

Annu Rev Biomed Data Sci ; 2024 May 09.

Artigo em Inglês | MEDLINE | ID: mdl-38723657

RESUMO

The progress of precision medicine research hinges on the gathering and analysis of extensive and diverse clinical datasets. With the continued expansion of modalities, scales, and sources of clinical datasets, it becomes imperative to devise methods for aggregating information from these varied sources to achieve a comprehensive understanding of diseases. In this review, we describe two important approaches for the analysis of diverse clinical datasets, namely the centralized model and federated model. We compare and contrast the strengths and weaknesses inherent in each model and present recent progress in methodologies and their associated challenges. Finally, we present an outlook on the opportunities that both models hold for the future analysis of clinical data.

9.

The Alzheimer's Knowledge Base: A Knowledge Graph for Alzheimer Disease Research.

Romano, Joseph D; Truong, Van; Kumar, Rachit; Venkatesan, Mythreye; Graham, Britney E; Hao, Yun; Matsumoto, Nick; Li, Xi; Wang, Zhiping; Ritchie, Marylyn D; Shen, Li; Moore, Jason H.

J Med Internet Res ; 26: e46777, 2024 Apr 18.

Artigo em Inglês | MEDLINE | ID: mdl-38635981

RESUMO

BACKGROUND: As global populations age and become susceptible to neurodegenerative illnesses, new therapies for Alzheimer disease (AD) are urgently needed. Existing data resources for drug discovery and repurposing fail to capture relationships central to the disease's etiology and response to drugs. OBJECTIVE: We designed the Alzheimer's Knowledge Base (AlzKB) to alleviate this need by providing a comprehensive knowledge representation of AD etiology and candidate therapeutics. METHODS: We designed the AlzKB as a large, heterogeneous graph knowledge base assembled using 22 diverse external data sources describing biological and pharmaceutical entities at different levels of organization (eg, chemicals, genes, anatomy, and diseases). AlzKB uses a Web Ontology Language 2 ontology to enforce semantic consistency and allow for ontological inference. We provide a public version of AlzKB and allow users to run and modify local versions of the knowledge base. RESULTS: AlzKB is freely available on the web and currently contains 118,902 entities with 1,309,527 relationships between those entities. To demonstrate its value, we used graph data science and machine learning to (1) propose new therapeutic targets based on similarities of AD to Parkinson disease and (2) repurpose existing drugs that may treat AD. For each use case, AlzKB recovers known therapeutic associations while proposing biologically plausible new ones. CONCLUSIONS: AlzKB is a new, publicly available knowledge resource that enables researchers to discover complex translational associations for AD drug discovery. Through 2 use cases, we show that it is a valuable tool for proposing novel therapeutic hypotheses based on public biomedical knowledge.

Assuntos

Doença de Alzheimer , Humanos , Doença de Alzheimer/tratamento farmacológico , Doença de Alzheimer/genética , Reconhecimento Automatizado de Padrão , Bases de Conhecimento , Aprendizado de Máquina , Conhecimento

10.

AI-luminating Artificial Intelligence in Inflammatory Bowel Diseases: A Narrative Review on the Role of AI in Endoscopy, Histology, and Imaging for IBD.

Gu, Phillip; Mendonca, Oreen; Carter, Dan; Dube, Shishir; Wang, Paul; Huang, Xiuzhen; Li, Debiao; Moore, Jason H; McGovern, Dermot P B.

Inflamm Bowel Dis ; 2024 Mar 07.

Artigo em Inglês | MEDLINE | ID: mdl-38452040

RESUMO

Endoscopy, histology, and cross-sectional imaging serve as fundamental pillars in the detection, monitoring, and prognostication of inflammatory bowel disease (IBD). However, interpretation of these studies often relies on subjective human judgment, which can lead to delays, intra- and interobserver variability, and potential diagnostic discrepancies. With the rising incidence of IBD globally coupled with the exponential digitization of these data, there is a growing demand for innovative approaches to streamline diagnosis and elevate clinical decision-making. In this context, artificial intelligence (AI) technologies emerge as a timely solution to address the evolving challenges in IBD. Early studies using deep learning and radiomics approaches for endoscopy, histology, and imaging in IBD have demonstrated promising results for using AI to detect, diagnose, characterize, phenotype, and prognosticate IBD. Nonetheless, the available literature has inherent limitations and knowledge gaps that need to be addressed before AI can transition into a mainstream clinical tool for IBD. To better understand the potential value of integrating AI in IBD, we review the available literature to summarize our current understanding and identify gaps in knowledge to inform future investigations.

11.

Interaction models matter: an efficient, flexible computational framework for model-specific investigation of epistasis.

Batista, Sandra; Madar, Vered Senderovich; Freda, Philip J; Bhandary, Priyanka; Ghosh, Attri; Matsumoto, Nicholas; Chitre, Apurva S; Palmer, Abraham A; Moore, Jason H.

BioData Min ; 17(1): 7, 2024 Feb 28.

Artigo em Inglês | MEDLINE | ID: mdl-38419006

RESUMO

PURPOSE: Epistasis, the interaction between two or more genes, is integral to the study of genetics and is present throughout nature. Yet, it is seldom fully explored as most approaches primarily focus on single-locus effects, partly because analyzing all pairwise and higher-order interactions requires significant computational resources. Furthermore, existing methods for epistasis detection only consider a Cartesian (multiplicative) model for interaction terms. This is likely limiting as epistatic interactions can evolve to produce varied relationships between genetic loci, some complex and not linearly separable. METHODS: We present new algorithms for the interaction coefficients for standard regression models for epistasis that permit many varied models for the interaction terms for loci and efficient memory usage. The algorithms are given for two-way and three-way epistasis and may be generalized to higher order epistasis. Statistical tests for the interaction coefficients are also provided. We also present an efficient matrix based algorithm for permutation testing for two-way epistasis. We offer a proof and experimental evidence that methods that look for epistasis only at loci that have main effects may not be justified. Given the computational efficiency of the algorithm, we applied the method to a rat data set and mouse data set, with at least 10,000 loci and 1,000 samples each, using the standard Cartesian model and the XOR model to explore body mass index. RESULTS: This study reveals that although many of the loci found to exhibit significant statistical epistasis overlap between models in rats, the pairs are mostly distinct. Further, the XOR model found greater evidence for statistical epistasis in many more pairs of loci in both data sets with almost all significant epistasis in mice identified using XOR. In the rat data set, loci involved in epistasis under the XOR model are enriched for biologically relevant pathways. CONCLUSION: Our results in both species show that many biologically relevant epistatic relationships would have been undetected if only one interaction model was applied, providing evidence that varied interaction models should be implemented to explore epistatic interactions that occur in living systems.

12.

Artificial intelligence and technology collaboratories: Empowering innovation in AI + AgeTech.

Li, Rose M; Abadir, Peter M; Battle, Alexis; Chellappa, Rama; Choudhry, Niteesh K; Demiris, George; Ganesan, Deepak; Karlawish, Jason; Moore, Jason H; Walston, Jeremy D.

J Am Geriatr Soc ; 72(5): 1602-1604, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38407353

Assuntos

Inteligência Artificial , Humanos , Geriatria

13.

Artificial Intelligence and Technology Collaboratories: Innovating aging research and Alzheimer's care.

Abadir, Peter; Oh, Esther; Chellappa, Rama; Choudhry, Niteesh; Demiris, George; Ganesan, Deepak; Karlawish, Jason; Marlin, Benjamin; Li, Rose M; Dehak, Najim; Arbaje, Alicia; Unberath, Mathias; Cudjoe, Thomas; Chute, Christopher; Moore, Jason H; Phan, Phillip; Samus, Quincy; Schoenborn, Nancy L; Battle, Alexis; Walston, Jeremy D.

Alzheimers Dement ; 20(4): 3074-3079, 2024 04.

Artigo em Inglês | MEDLINE | ID: mdl-38324244

RESUMO

This perspective outlines the Artificial Intelligence and Technology Collaboratories (AITC) at Johns Hopkins University, University of Pennsylvania, and University of Massachusetts, highlighting their roles in developing AI-based technologies for older adult care, particularly targeting Alzheimer's disease (AD). These National Institute on Aging (NIA) centers foster collaboration among clinicians, gerontologists, ethicists, business professionals, and engineers to create AI solutions. Key activities include identifying technology needs, stakeholder engagement, training, mentoring, data integration, and navigating ethical challenges. The objective is to apply these innovations effectively in real-world scenarios, including in rural settings. In addition, the AITC focuses on developing best practices for AI application in the care of older adults, facilitating pilot studies, and addressing ethical concerns related to technology development for older adults with cognitive impairment, with the ultimate aim of improving the lives of older adults and their caregivers. HIGHLIGHTS: Addressing the complex needs of older adults with Alzheimer's disease (AD) requires a comprehensive approach, integrating medical and social support. Current gaps in training, techniques, tools, and expertise hinder uniform access across communities and health care settings. Artificial intelligence (AI) and digital technologies hold promise in transforming care for this demographic. Yet, transitioning these innovations from concept to marketable products presents significant challenges, often stalling promising advancements in the developmental phase. The Artificial Intelligence and Technology Collaboratories (AITC) program, funded by the National Institute on Aging (NIA), presents a viable model. These Collaboratories foster the development and implementation of AI methods and technologies through projects aimed at improving care for older Americans, particularly those with AD, and promote the sharing of best practices in AI and technology integration. Why Does This Matter? The National Institute on Aging (NIA) Artificial Intelligence and Technology Collaboratories (AITC) program's mission is to accelerate the adoption of artificial intelligence (AI) and new technologies for the betterment of older adults, especially those with dementia. By bridging scientific and technological expertise, fostering clinical and industry partnerships, and enhancing the sharing of best practices, this program can significantly improve the health and quality of life for older adults with Alzheimer's disease (AD).

Assuntos

Doença de Alzheimer , Isotiocianatos , Estados Unidos , Humanos , Idoso , Doença de Alzheimer/terapia , Inteligência Artificial , Gerociência , Qualidade de Vida , Tecnologia

14.

Artificial intelligence: revolutionizing cardiology with large language models.

Boonstra, Machteld J; Weissenbacher, Davy; Moore, Jason H; Gonzalez-Hernandez, Graciela; Asselbergs, Folkert W.

Eur Heart J ; 45(5): 332-345, 2024 Feb 01.

Artigo em Inglês | MEDLINE | ID: mdl-38170821

RESUMO

Natural language processing techniques are having an increasing impact on clinical care from patient, clinician, administrator, and research perspective. Among others are automated generation of clinical notes and discharge letters, medical term coding for billing, medical chatbots both for patients and clinicians, data enrichment in the identification of disease symptoms or diagnosis, cohort selection for clinical trial, and auditing purposes. In the review, an overview of the history in natural language processing techniques developed with brief technical background is presented. Subsequently, the review will discuss implementation strategies of natural language processing tools, thereby specifically focusing on large language models, and conclude with future opportunities in the application of such techniques in the field of cardiology.

Assuntos

Inteligência Artificial , Cardiologia , Humanos , Processamento de Linguagem Natural , Alta do Paciente

15.

The Molecular Twin artificial-intelligence platform integrates multi-omic data to predict outcomes for pancreatic adenocarcinoma patients.

Osipov, Arsen; Nikolic, Ognjen; Gertych, Arkadiusz; Parker, Sarah; Hendifar, Andrew; Singh, Pranav; Filippova, Darya; Dagliyan, Grant; Ferrone, Cristina R; Zheng, Lei; Moore, Jason H; Tourtellotte, Warren; Van Eyk, Jennifer E; Theodorescu, Dan.

Nat Cancer ; 5(2): 299-314, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38253803

RESUMO

Contemporary analyses focused on a limited number of clinical and molecular biomarkers have been unable to accurately predict clinical outcomes in pancreatic ductal adenocarcinoma. Here we describe a precision medicine platform known as the Molecular Twin consisting of advanced machine-learning models and use it to analyze a dataset of 6,363 clinical and multi-omic molecular features from patients with resected pancreatic ductal adenocarcinoma to accurately predict disease survival (DS). We show that a full multi-omic model predicts DS with the highest accuracy and that plasma protein is the top single-omic predictor of DS. A parsimonious model learning only 589 multi-omic features demonstrated similar predictive performance as the full multi-omic model. Our platform enables discovery of parsimonious biomarker panels and performance assessment of outcome prediction models learning from resource-intensive panels. This approach has considerable potential to impact clinical care and democratize precision cancer medicine worldwide.

Assuntos

Adenocarcinoma , Carcinoma Ductal Pancreático , Neoplasias Pancreáticas , Humanos , Adenocarcinoma/genética , Adenocarcinoma/cirurgia , Neoplasias Pancreáticas/genética , Neoplasias Pancreáticas/cirurgia , Multiômica , Inteligência Artificial , Carcinoma Ductal Pancreático/genética , Carcinoma Ductal Pancreático/cirurgia , Inteligência

16.

mixWAS: An efficient distributed algorithm for mixed-outcomes genome-wide association studies.

Li, Ruowang; Benz, Luke; Duan, Rui; Denny, Joshua C; Hakonarson, Hakon; Mosley, Jonathan D; Smoller, Jordan W; Wei, Wei-Qi; Ritchie, Marylyn D; Moore, Jason H; Chen, Yong.

medRxiv ; 2024 Jan 10.

Artigo em Inglês | MEDLINE | ID: mdl-38260403

RESUMO

Genome-wide association studies (GWAS) have been instrumental in identifying genetic associations for various diseases and traits. However, uncovering genetic underpinnings among traits beyond univariate phenotype associations remains a challenge. Multi-phenotype associations (MPA), or genetic pleiotropy, offer important insights into shared genes and pathways among traits, enhancing our understanding of genetic architectures of complex diseases. GWAS of biobank-linked electronic health record (EHR) data are increasingly being utilized to identify MPA among various traits and diseases. However, methodologies that can efficiently take advantage of distributed EHR to detect MPA are still lacking. Here, we introduce mixWAS, a novel algorithm that efficiently and losslessly integrates multiple EHRs via summary statistics, allowing the detection of MPA among mixed phenotypes while accounting for heterogeneities across EHRs. Simulations demonstrate that mixWAS outperforms the widely used MPA detection method, Phenome-wide association study (PheWAS), across diverse scenarios. Applying mixWAS to data from seven EHRs in the US, we identified 4,534 MPA among blood lipids, BMI, and circulatory diseases. Validation in an independent EHR data from UK confirmed 97.7% of the associations. mixWAS fundamentally improves the detection of MPA and is available as a free, open-source software.

17.

Cluster Analysis reveals Socioeconomic Disparities among Elective Spine Surgery Patients.

Orlenko, Alena; Freda, Philip J; Ghosh, Attri; Choi, Hyunjun; Matsumoto, Nicholas; Bright, Tiffani J; Walker, Corey T; Obafemi-Ajayi, Tayo; Moore, Jason H.

Pac Symp Biocomput ; 29: 359-373, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38160292

RESUMO

This work demonstrates the use of cluster analysis in detecting fair and unbiased novel discoveries. Given a sample population of elective spinal fusion patients, we identify two overarching subgroups driven by insurance type. The Medicare group, associated with lower socioeconomic status, exhibited an over-representation of negative risk factors. The findings provide a compelling depiction of the interwoven socioeconomic and racial disparities present within the healthcare system, highlighting their consequential effects on health inequalities. The results are intended to guide design of fair and precise machine learning models based on intentional integration of population stratification.

Assuntos

Medicare , Disparidades Socioeconômicas em Saúde , Idoso , Humanos , Estados Unidos , Biologia Computacional , Grupos Raciais , Análise por Conglomerados

18.

Risk prediction: Methods, Challenges, and Opportunities.

Li, Ruowang; Duan, Rui; He, Lifang; Moore, Jason H.

Pac Symp Biocomput ; 29: 650-653, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38160314

RESUMO

The following sections are included:Introduction to the workshopWorkshop Presenters.

19.

SynTwin: A graph-based approach for predicting clinical outcomes using digital twins derived from synthetic patients.

Moore, Jason H; Li, Xi; Chang, Jui-Hsuan; Tatonetti, Nicholas P; Theodorescu, Dan; Chen, Yong; Asselbergs, Folkert W; Venkatesan, Mythreye; Wang, Zhiping Paul.

Pac Symp Biocomput ; 29: 96-107, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38160272

RESUMO

The concept of a digital twin came from the engineering, industrial, and manufacturing domains to create virtual objects or machines that could inform the design and development of real objects. This idea is appealing for precision medicine where digital twins of patients could help inform healthcare decisions. We have developed a methodology for generating and using digital twins for clinical outcome prediction. We introduce a new approach that combines synthetic data and network science to create digital twins (i.e. SynTwin) for precision medicine. First, our approach starts by estimating the distance between all subjects based on their available features. Second, the distances are used to construct a network with subjects as nodes and edges defining distance less than the percolation threshold. Third, communities or cliques of subjects are defined. Fourth, a large population of synthetic patients are generated using a synthetic data generation algorithm that models the correlation structure of the data to generate new patients. Fifth, digital twins are selected from the synthetic patient population that are within a given distance defining a subject community in the network. Finally, we compare and contrast community-based prediction of clinical endpoints using real subjects, digital twins, or both within and outside of the community. Key to this approach are the digital twins defined using patient similarity that represent hypothetical unobserved patients with patterns similar to nearby real patients as defined by network distance and community structure. We apply our SynTwin approach to predicting mortality in a population-based cancer registry (n=87,674) from the Surveillance, Epidemiology, and End Results (SEER) program from the National Cancer Institute (USA). Our results demonstrate that nearest network neighbor prediction of mortality in this study is significantly improved with digital twins (AUROC=0.864, 95% CI=0.857-0.872) over just using real data alone (AUROC=0.791, 95% CI=0.781-0.800). These results suggest a network-based digital twin strategy using synthetic patients may add value to precision medicine efforts.

Assuntos

Algoritmos , Biologia Computacional , Humanos , Análise por Conglomerados , Medicina de Precisão

20.

Ten simple rules for managing laboratory information.

Berezin, Casey-Tyler; Aguilera, Luis U; Billerbeck, Sonja; Bourne, Philip E; Densmore, Douglas; Freemont, Paul; Gorochowski, Thomas E; Hernandez, Sarah I; Hillson, Nathan J; King, Connor R; Köpke, Michael; Ma, Shuyi; Miller, Katie M; Moon, Tae Seok; Moore, Jason H; Munsky, Brian; Myers, Chris J; Nicholas, Dequina A; Peccoud, Samuel J; Zhou, Wen; Peccoud, Jean.

PLoS Comput Biol ; 19(12): e1011652, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-38060459

RESUMO

Information is the cornerstone of research, from experimental (meta)data and computational processes to complex inventories of reagents and equipment. These 10 simple rules discuss best practices for leveraging laboratory information management systems to transform this large information load into useful scientific findings.

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA