Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 307
Filtrar
1.
Artículo en Inglés | MEDLINE | ID: mdl-38926131

RESUMEN

OBJECTIVES: Heart failure (HF) impacts millions of patients worldwide, yet the variability in treatment responses remains a major challenge for healthcare professionals. The current treatment strategies, largely derived from population based evidence, often fail to consider the unique characteristics of individual patients, resulting in suboptimal outcomes. This study aims to develop computational models that are patient-specific in predicting treatment outcomes, by utilizing a large Electronic Health Records (EHR) database. The goal is to improve drug response predictions by identifying specific HF patient subgroups that are likely to benefit from existing HF medications. MATERIALS AND METHODS: A novel, graph-based model capable of predicting treatment responses, combining Graph Neural Network and Transformer was developed. This method differs from conventional approaches by transforming a patient's EHR data into a graph structure. By defining patient subgroups based on this representation via K-Means Clustering, we were able to enhance the performance of drug response predictions. RESULTS: Leveraging EHR data from 11 627 Mayo Clinic HF patients, our model significantly outperformed traditional models in predicting drug response using NT-proBNP as a HF biomarker across five medication categories (best RMSE of 0.0043). Four distinct patient subgroups were identified with differential characteristics and outcomes, demonstrating superior predictive capabilities over existing HF subtypes (best mean RMSE of 0.0032). DISCUSSION: These results highlight the power of graph-based modeling of EHR in improving HF treatment strategies. The stratification of patients sheds light on particular patient segments that could benefit more significantly from tailored response predictions. CONCLUSIONS: Longitudinal EHR data have the potential to enhance personalized prognostic predictions through the application of graph-based AI techniques.

2.
Int J Biol Macromol ; : 133477, 2024 Jun 26.
Artículo en Inglés | MEDLINE | ID: mdl-38942413

RESUMEN

The highly efficient removal of oils such as oils or dyes from wastewater has aroused wide concern and is of great significance for clean production and environmental remediation. The synthesis of a novel aerogel (designated as HEC/LS) is reported herein, achieved through a sol-gel method followed by freeze-drying utilizing loofa and hydroxyethyl cellulose as the raw materials. The new HEC/LS aerogel exhibits excellent porosity and specific surface area, with a porosity of 88.70 %, a total pore area of 0.607 m2 g-1, and a specific surface area of 230 m2 g-1. The prepared HEC/LS aerogel exhibits exceptional hydrophilicity and self-floatability, facilitating its rapid absorption of water up to 21 times its own weight within a mere 3 s. Additionally, it demonstrates good adsorption performance for methylene blue (MB), with a maximum adsorption capacity of 83.30 mg g-1. Subsequently, a new hydrophobic microorganisms-loaded composite aerogel (namely, Bn-HEC/LS) was obtained by doping microorganisms into the as-prepared HEC/LS in multiple enrichment followed by a hydrophobic and oleophilic surface modification. Based on its rich porous structure and oleophilic wettability, the as-synthesized Bn-HEC/LS exhibits excellent selective adsorption and degradation properties for the oil contamination, the diesel oil could be selectively absorbed in the Bn-HEC/LS and degraded by the loaded microorganisms. Among them, B5-HEC/LS displays the highest removal efficiency of 94.50 % within 180 h, while free microorganisms and HEC/LS aerogels show degradation efficiencies of only 21.70 % and 48.10 %, respectively. The fixation of microorganisms in the aerogel increases their number within the material and enhances the relative microorganisms removal capacity. The hydrophobic and lipophilic modifications improve the selective adsorption performance of the aerogel on diesel oil, resulting in a significantly high removal rate of Bn-HEC/LS for diesel oil. The results indicate that the immobilization of microorganisms into aerogel improves the activity of microorganisms, and the hydrophobic and oleophilic modification enhances the selective adsorption performance of aerogel to diesel oil, thus resulting in a very high removal rate of Bn-HEC/LS for diesel oil. This study is expected to provide a now possibility for the green and efficient bioremediation of oils.

3.
Molecules ; 29(12)2024 Jun 11.
Artículo en Inglés | MEDLINE | ID: mdl-38930832

RESUMEN

In this research, with an aim to develop novel pyrazole oxime ether derivatives possessing potential biological activity, thirty-two pyrazole oxime ethers, including a substituted pyridine ring, have been synthesized and structurally identified through 1H NMR, 13C NMR, and HRMS. Bioassay data indicated that most of these compounds owned strong insecticidal properties against Mythimna separata, Tetranychus cinnabarinus, Plutella xylostella, and Aphis medicaginis at a dosage of 500 µg/mL, and some title compounds were active towards Nilaparvata lugens at 500 µg/mL. Furthermore, some of the designed compounds had potent insecticidal effects against M. separata, T. cinnabarinus, or A. medicaginis at 100 µg/mL, with the mortalities of compounds 8a, 8c, 8d, 8e, 8f, 8g, 8o, 8s, 8v, 8x, and 8z against A. medicaginis, in particular, all reaching 100%. Even when the dosage was lowered to 20 µg/mL, compound 8s also expressed 50% insecticidal activity against M. separata, and compounds 8a, 8e, 8f, 8o, 8v, and 8x displayed more than 60% inhibition rates against A. medicaginis. The current results provided a significant basis for the rational design of biologically active pyrazole oxime ethers in future.


Asunto(s)
Diseño de Fármacos , Insecticidas , Oximas , Pirazoles , Pirazoles/química , Pirazoles/farmacología , Pirazoles/síntesis química , Oximas/química , Oximas/farmacología , Oximas/síntesis química , Insecticidas/química , Insecticidas/síntesis química , Insecticidas/farmacología , Animales , Relación Estructura-Actividad , Éteres/química , Estructura Molecular , Piridinas/química , Piridinas/farmacología , Piridinas/síntesis química , Mariposas Nocturnas/efectos de los fármacos
5.
AMIA Jt Summits Transl Sci Proc ; 2024: 593-602, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38827050

RESUMEN

Liver transplantation often faces fairness challenges across subgroups defined by sensitive attributes such as age group, gender, and race/ethnicity. Machine learning models for outcome prediction can introduce additional biases. Therefore, we introduce Fairness through the Equitable Rate of Improvement in Multitask Learning (FERI) algorithm for fair predictions of graft failure risk in liver transplant patients. FERI constrains subgroup loss by balancing learning rates and preventing subgroup dominance in the training process. Our results show that FERI maintained high predictive accuracy with AUROC and AUPRC comparable to baseline models. More importantly, FERI demonstrated an ability to improve fairness without sacrificing accuracy. Specifically, for the gender, FERI reduced the demographic parity disparity by 71.74%, and for the age group, it decreased the equalized odds disparity by 40.46%. Therefore, the FERI algorithm advanced fairness-aware predictive modeling in healthcare and provides an invaluable tool for equitable healthcare systems.

6.
HGG Adv ; 5(3): 100312, 2024 May 24.
Artículo en Inglés | MEDLINE | ID: mdl-38796699

RESUMEN

Orofacial clefts (OFCs) are among the most common human congenital birth defects. Previous multiethnic studies have identified dozens of associated loci for both cleft lip with or without cleft palate (CL/P) and cleft palate alone (CP). Although several nearby genes have been highlighted, the "casual" variants are largely unknown. Here, we developed DeepFace, a convolutional neural network model, to assess the functional impact of variants by SNP activity difference (SAD) scores. The DeepFace model is trained with 204 epigenomic assays from crucial human embryonic craniofacial developmental stages of post-conception week (pcw) 4 to pcw 10. The Pearson correlation coefficient between the predicted and actual values for 12 epigenetic features achieved a median range of 0.50-0.83. Specifically, our model revealed that SNPs significantly associated with OFCs tended to exhibit higher SAD scores across various variant categories compared to less related groups, indicating a context-specific impact of OFC-related SNPs. Notably, we identified six SNPs with a significant linear relationship to SAD scores throughout developmental progression, suggesting that these SNPs could play a temporal regulatory role. Furthermore, our cell-type specificity analysis pinpointed the trophoblast cell as having the highest enrichment of risk signals associated with OFCs. Overall, DeepFace can harness distal regulatory signals from extensive epigenomic assays, offering new perspectives for prioritizing OFC variants using contextualized functional genomic features. We expect DeepFace to be instrumental in accessing and predicting the regulatory roles of variants associated with OFCs, and the model can be extended to study other complex diseases or traits.

7.
medRxiv ; 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38746270

RESUMEN

Background: Synoptic reporting, the documenting of clinical information in a structured manner, is known to improve patient care by reducing errors, increasing readability, interoperability, and report completeness. Despite its advantages, manually synthesizing synoptic reports from narrative reports is expensive and error prone when the number of structured fields are many. While the recent revolutionary developments in Large Language Models (LLMs) have significantly advanced natural language processing, their potential for innovations in medicine is yet to be fully evaluated. Objectives: In this study, we explore the strengths and challenges of utilizing the state-of-the-art language models in the automatic synthesis of synoptic reports. Materials and Methods: We use a corpus of 7,774 cancer related, narrative pathology reports, which have annotated reference synoptic reports from Mayo Clinic EHR. Using these annotations as a reference, we reconfigure the state-of-the-art large language models, such as LLAMA-2, to generate the synoptic reports. Our annotated reference synoptic reports contain 22 unique data elements. To evaluate the accuracy of the reports generated by the LLMs, we use several metrics including the BERT F1 Score and verify our results by manual validation. Results: We show that using fine-tuned LLAMA-2 models, we can obtain BERT Score F1 of 0.86 or higher across all data elements and BERT F1 scores of 0.94 or higher on over 50% (11 of 22) of the questions. The BERT F1 scores translate to average accuracies of 76% and as high as 81% for short clinical reports. Conclusions: We demonstrate successful automatic synoptic report generation by fine-tuning large language models.

8.
PLOS Digit Health ; 3(5): e0000493, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38713647

RESUMEN

Randomized Clinical trials (RCT) suffer from a high failure rate which could be caused by heterogeneous responses to treatment. Despite many models being developed to estimate heterogeneous treatment effects (HTE), there remains a lack of interpretable methods to identify responsive subgroups. This work aims to develop a framework to identify subgroups based on treatment effects that prioritize model interpretability. The proposed framework leverages an ensemble uplift tree method to generate descriptive decision rules that separate samples given estimated responses to the treatment. Subsequently, we select a complementary set of these decision rules and rank them using a sparse linear model. To address the trial's limited sample size problem, we proposed a data augmentation strategy by borrowing control patients from external studies and generating synthetic data. We apply the proposed framework to a failed randomized clinical trial for investigating an intracerebral hemorrhage therapy plan. The Qini-scores show that the proposed data augmentation strategy plan can boost the model's performance and the framework achieves greater interpretability by selecting complementary descriptive rules without compromising estimation quality. Our model derives clinically meaningful subgroups. Specifically, we find those patients with Diastolic Blood Pressure≥70 mm hg and Systolic Blood Pressure<215 mm hg benefit more from intensive blood pressure reduction therapy. The proposed interpretable HTE analysis framework offers a promising potential for extracting meaningful insight from RCTs with neutral treatment effects. By identifying responsive subgroups, our framework can contribute to developing personalized treatment strategies for patients more efficiently.

9.
Res Sq ; 2024 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-38746131

RESUMEN

Background: The potential benefits of drug combination synergy in cancer medicine are significant, yet the risks must be carefully managed due to the possibility of increased toxicity. Although artificial intelligence applications have demonstrated notable success in predicting drug combination synergy, several key challenges persist: (1) Existing models often predict average synergy values across a restricted range of testing dosages, neglecting crucial dose amounts and the mechanisms of action of the drugs involved. (2) Many graph-based models rely on static protein-protein interactions, failing to adapt to dynamic and context-dependent networks. This limitation constrains the applicability of current methods. Results: We introduced SAFER, a Sub-hypergraph Attention-based graph model, addressing these issues by incorporating complex relationships among biological knowledge networks and considering dosing effects on subject-specific networks. SAFER outperformed previous models on the benchmark and the independent test set. The analysis of subgraph attention weight for the lung cancer cell line highlighted JAK-STAT signaling pathway, PRDM12, ZNF781, and CDC5L that have been implicated in lung fibrosis. Conclusions: SAFER presents an interpretable framework designed to identify drug-responsive signals. Tailored for comprehending dose effects on subject-specific molecular contexts, our model uniquely captures dose-level drug combination responses. This capability unlocks previously inaccessible avenues of investigation compared to earlier models. Finally, the SAFER framework can be leveraged by future inquiries to investigate molecular networks that uniquely characterize individual patients.

10.
BMC Med Inform Decis Mak ; 24(1): 147, 2024 May 30.
Artículo en Inglés | MEDLINE | ID: mdl-38816848

RESUMEN

BACKGROUND: Securing adequate data privacy is critical for the productive utilization of data. De-identification, involving masking or replacing specific values in a dataset, could damage the dataset's utility. However, finding a reasonable balance between data privacy and utility is not straightforward. Nonetheless, few studies investigated how data de-identification efforts affect data analysis results. This study aimed to demonstrate the effect of different de-identification methods on a dataset's utility with a clinical analytic use case and assess the feasibility of finding a workable tradeoff between data privacy and utility. METHODS: Predictive modeling of emergency department length of stay was used as a data analysis use case. A logistic regression model was developed with 1155 patient cases extracted from a clinical data warehouse of an academic medical center located in Seoul, South Korea. Nineteen de-identified datasets were generated based on various de-identification configurations using ARX, an open-source software for anonymizing sensitive personal data. The variable distributions and prediction results were compared between the de-identified datasets and the original dataset. We examined the association between data privacy and utility to determine whether it is feasible to identify a viable tradeoff between the two. RESULTS: All 19 de-identification scenarios significantly decreased re-identification risk. Nevertheless, the de-identification processes resulted in record suppression and complete masking of variables used as predictors, thereby compromising dataset utility. A significant correlation was observed only between the re-identification reduction rates and the ARX utility scores. CONCLUSIONS: As the importance of health data analysis increases, so does the need for effective privacy protection methods. While existing guidelines provide a basis for de-identifying datasets, achieving a balance between high privacy and utility is a complex task that requires understanding the data's intended use and involving input from data users. This approach could help find a suitable compromise between data privacy and utility.


Asunto(s)
Confidencialidad , Anonimización de la Información , Humanos , Confidencialidad/normas , Servicio de Urgencia en Hospital , Tiempo de Internación , República de Corea , Masculino
11.
PLOS Digit Health ; 3(4): e0000479, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38598464

RESUMEN

The rate of progression of Alzheimer's disease (AD) differs dramatically between patients. Identifying the most is critical because when their numbers differ between treated and control groups, it distorts the outcome, making it impossible to tell whether the treatment was beneficial. Much recent effort, then, has gone into identifying RPs. We pooled de-identified placebo-arm data of three randomized controlled trials (RCTs), EXPEDITION, EXPEDITION 2, and EXPEDITION 3, provided by Eli Lilly and Company. After processing, the data included 1603 mild-to-moderate AD patients with 80 weeks of longitudinal observations on neurocognitive health, brain volumes, and amyloid-beta (Aß) levels. RPs were defined by changes in four neurocognitive/functional health measures. We built deep learning models using recurrent neural networks with attention mechanisms to predict RPs by week 80 based on varying observation periods from baseline (e.g., 12, 28 weeks). Feature importance scores for RP prediction were computed and temporal feature trajectories were compared between RPs and non-RPs. Our evaluation and analysis focused on models trained with 28 weeks of observation. The models achieved robust internal validation area under the receiver operating characteristic (AUROCs) ranging from 0.80 (95% CI 0.79-0.82) to 0.82 (0.81-0.83), and the area under the precision-recall curve (AUPRCs) from 0.34 (0.32-0.36) to 0.46 (0.44-0.49). External validation AUROCs ranged from 0.75 (0.70-0.81) to 0.83 (0.82-0.84) and AUPRCs from 0.27 (0.25-0.29) to 0.45 (0.43-0.48). Aß plasma levels, regional brain volumetry, and neurocognitive health emerged as important factors for the model prediction. In addition, the trajectories were stratified between predicted RPs and non-RPs based on factors such as ventricular volumes and neurocognitive domains. Our findings will greatly aid clinical trialists in designing tests for new medications, representing a key step toward identifying effective new AD therapies.

12.
Artículo en Inglés | MEDLINE | ID: mdl-38520725

RESUMEN

OBJECTIVES: The rapid expansion of biomedical literature necessitates automated techniques to discern relationships between biomedical concepts from extensive free text. Such techniques facilitate the development of detailed knowledge bases and highlight research deficiencies. The LitCoin Natural Language Processing (NLP) challenge, organized by the National Center for Advancing Translational Science, aims to evaluate such potential and provides a manually annotated corpus for methodology development and benchmarking. MATERIALS AND METHODS: For the named entity recognition (NER) task, we utilized ensemble learning to merge predictions from three domain-specific models, namely BioBERT, PubMedBERT, and BioM-ELECTRA, devised a rule-driven detection method for cell line and taxonomy names and annotated 70 more abstracts as additional corpus. We further finetuned the T0pp model, with 11 billion parameters, to boost the performance on relation extraction and leveraged entites' location information (eg, title, background) to enhance novelty prediction performance in relation extraction (RE). RESULTS: Our pioneering NLP system designed for this challenge secured first place in Phase I-NER and second place in Phase II-relation extraction and novelty prediction, outpacing over 200 teams. We tested OpenAI ChatGPT 3.5 and ChatGPT 4 in a Zero-Shot setting using the same test set, revealing that our finetuned model considerably surpasses these broad-spectrum large language models. DISCUSSION AND CONCLUSION: Our outcomes depict a robust NLP system excelling in NER and RE across various biomedical entities, emphasizing that task-specific models remain superior to generic large ones. Such insights are valuable for endeavors like knowledge graph development and hypothesis formulation in biomedical research.

13.
Pain Physician ; 27(3): E305-E316, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38506677

RESUMEN

BACKGROUND: Lumbar disc herniation is a common spinal disease that causes low back pain; surgery is required when conservative treatment is ineffective. There is a growing demand for minimally invasive surgery in younger patient populations due to their fear of significant damage and a long recovery period following standard open discectomy. The development history of minimally invasive surgery is relatively short, and no gold standard has been established. OBJECTIVES: We aimed to find, via a network meta-analysis, the best treatment for low back pain in younger patient populations. STUDY DESIGN: Network meta-analysis. METHODS: The PubMed, Embase, Cochrane Library, and Web of Science databases were searched. Data quality was evaluated using RevMan 5.3 (The Nordic Cochrane Centre for The Cochrane Collaboration), while STATA 14.0 (StataCorp LLC) was used for the network meta-analysis and to merge data on the Visual Analog Scale (VAS) score, Oswestry Disability Index (ODI) score, complication, blood loss, reoperation rate, and function score. RESULTS: We included 50 randomized controlled trials, involving 7 interventions; heterogeneity and inconsistency were acceptable. Comparatively, microendoscopic discectomy and percutaneous endoscopic lumbar discectomy were the best surgical procedures from the aspects of VAS score and ODI score, while standard open discectomy was the worst one from the aspect of ODI score. Regarding complications, tubular discectomy was preferred with the fewest complications. Additionally, microendoscopic discectomy outperformed other surgical procedures in reducing blood loss and reoperation rate. LIMITATIONS: First, follow-up data were not reported in all included studies, and the follow-up time varied from several months to 8 years, which affected the results accuracy of our study to some extent. Second, there were some nonsurgical factors that also affected the self-reported outcomes, such as rehabilitation and pain management, which also brought a certain bias in our study results. CONCLUSIONS: Compared to standard open discectomy, minimally invasive surgical procedures not only achieve satisfactory efficacy, but also microendoscopic discectomy and percutaneous endoscopic lumbar discectomy can obtain a more satisfactory short-term VAS score and ODI score. Microendoscopic discectomy has significant advantages in blood loss and reoperation rate, and tubular discectomy has fewer postoperative complications.


Asunto(s)
Discectomía Percutánea , Desplazamiento del Disco Intervertebral , Dolor de la Región Lumbar , Humanos , Desplazamiento del Disco Intervertebral/cirugía , Metaanálisis en Red , Vértebras Lumbares/cirugía , Procedimientos Quirúrgicos Mínimamente Invasivos , Discectomía
14.
J Dent ; 144: 104921, 2024 05.
Artículo en Inglés | MEDLINE | ID: mdl-38437976

RESUMEN

OBJECTIVES: This study aimed to identify predictors associated with the tooth loss phenotype in a large periodontitis patient cohort in the university setting. METHODS: Information on periodontitis patients and nineteen factors identified at the initial visit was extracted from electronic health records. The primary outcome is tooth loss phenotype (presence or absence of tooth loss). Prediction models were built on significant factors (single or combinatory) selected by the RuleFit algorithm, and these factors were further adopted by regression models. Model performance was evaluated by Area Under the Receiver Operating Characteristic Curve (AUROC) and Area Under the Precision-Recall Curve (AUPRC). Associations between predictors and the tooth loss phenotype were also evaluated by classical statistical approaches to validate the performance of machine learning models. RESULTS: In total, 7840 patients were included. The machine learning model predicting the tooth loss phenotype achieved AUROC of 0.71 and AUPRC of 0.66. Age, periodontal diagnosis, number of missing teeth at baseline, furcation involvement, and tooth mobility were associated with the tooth loss phenotype in both machine learning and classical statistical models. CONCLUSIONS: The rule-based machine learning approach improves model explainability compared to classical statistical methods. However, the model's generalizability needs to be further validated by external datasets. CLINICAL SIGNIFICANCE: Predictors identified by the current machine learning approach using the RuleFit algorithm had clinically relevant thresholds in predicting the tooth loss phenotype in a large and diverse periodontitis patient cohort. The results of this study will assist clinicians in performing risk assessment for periodontitis at the initial visit.


Asunto(s)
Aprendizaje Automático , Periodontitis , Fenotipo , Pérdida de Diente , Humanos , Masculino , Femenino , Periodontitis/complicaciones , Persona de Mediana Edad , Adulto , Curva ROC , Movilidad Dentaria , Factores de Riesgo , Algoritmos , Registros Electrónicos de Salud , Estudios de Cohortes , Área Bajo la Curva , Defectos de Furcación , Anciano
15.
J Biomed Inform ; 151: 104606, 2024 03.
Artículo en Inglés | MEDLINE | ID: mdl-38325698

RESUMEN

Electronic health records (EHRs) store an extensive array of patient information, encompassing medical histories, diagnoses, treatments, and test outcomes. These records are crucial for enabling healthcare providers to make well-informed decisions regarding patient care. Summarizing clinical notes further assists healthcare professionals in pinpointing potential health risks and making better-informed decisions. This process contributes to reducing errors and enhancing patient outcomes by ensuring providers have access to the most pertinent and current patient data. Recent research has shown that incorporating instruction prompts with large language models (LLMs) substantially boosts the efficacy of summarization tasks. However, we show that this approach also leads to increased performance variance, resulting in significantly distinct summaries even when instruction prompts share similar meanings. To tackle this challenge, we introduce a model-agnostic Soft Prompt-BasedCalibration (SPeC) pipeline that employs soft prompts to lower variance while preserving the advantages of prompt-based summarization. Experimental findings on multiple clinical note tasks and LLMs indicate that our method not only bolsters performance but also effectively regulates variance across different LLMs, providing a more consistent and reliable approach to summarizing critical medical information.


Asunto(s)
Registros Electrónicos de Salud , Procesamiento de Lenguaje Natural , Humanos , Calibración , Lenguaje , Personal de Salud
16.
J Alzheimers Dis ; 97(4): 1807-1827, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38306043

RESUMEN

Background: The progressive cognitive decline, an integral component of Alzheimer's disease (AD), unfolds in tandem with the natural aging process. Neuroimaging features have demonstrated the capacity to distinguish cognitive decline changes stemming from typical brain aging and AD between different chronological points. Objective: To disentangle the normal aging effect from the AD-related accelerated cognitive decline and unravel its genetic components using a neuroimaging-based deep learning approach. Methods: We developed a deep-learning framework based on a dual-loss Siamese ResNet network to extract fine-grained information from the longitudinal structural magnetic resonance imaging (MRI) data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) study. We then conducted genome-wide association studies (GWAS) and post-GWAS analyses to reveal the genetic basis of AD-related accelerated cognitive decline. Results: We used our model to process data from 1,313 individuals, training it on 414 cognitively normal people and predicting cognitive assessment for all participants. In our analysis of accelerated cognitive decline GWAS, we identified two genome-wide significant loci: APOE locus (chromosome 19 p13.32) and rs144614292 (chromosome 11 p15.1). Variant rs144614292 (G > T) has not been reported in previous AD GWA studies. It is within the intronic region of NELL1, which is expressed in neurons and plays a role in controlling cell growth and differentiation. The cell-type-specific enrichment analysis and functional enrichment of GWAS signals highlighted the microglia and immune-response pathways. Conclusions: Our deep learning model effectively extracted relevant neuroimaging features and predicted individual cognitive decline. We reported a novel variant (rs144614292) within the NELL1 gene.


Asunto(s)
Enfermedad de Alzheimer , Disfunción Cognitiva , Aprendizaje Profundo , Humanos , Enfermedad de Alzheimer/diagnóstico por imagen , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/patología , Estudio de Asociación del Genoma Completo , Neuroimagen/métodos , Imagen por Resonancia Magnética/métodos , Encéfalo/patología , Disfunción Cognitiva/diagnóstico por imagen , Disfunción Cognitiva/genética , Disfunción Cognitiva/patología
17.
NPJ Digit Med ; 7(1): 40, 2024 Feb 19.
Artículo en Inglés | MEDLINE | ID: mdl-38374445

RESUMEN

Large language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology and medicine has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structured data and sample size are limited, by extracting prior knowledge from text corpora. Here we report our proposed few-shot learning approach, which uses LLMs to predict the synergy of drug pairs in rare tissues that lack structured data and features. Our experiments, which involved seven rare tissues from different cancer types, demonstrate that the LLM-based prediction model achieves significant accuracy with very few or zero samples. Our proposed model, the CancerGPT (with ~ 124M parameters), is comparable to the larger fine-tuned GPT-3 model (with ~ 175B parameters). Our research contributes to tackling drug pair synergy prediction in rare tissues with limited data, and also advancing the use of LLMs for biological and medical inference tasks.

18.
Artículo en Inglés | MEDLINE | ID: mdl-38281369

RESUMEN

Imatinib is the tyrosine kinase inhibitor of choice for the treatment of chronic myeloid leukemia and gastrointestinal stromal tumors. However, imatinib has drawbacks such as drug resistance and significant differences in pharmacokinetics within patients. Therefore, a colloidal gold-based immunochromatographic assay (CG-IA) was developed for measuring and monitoring imatinib in human serum. An imatinib derivative containing carboxyl groups was used for the synthesis of the immunogen, and 4-(4-methyl-1-piperazinylmethyl) benzoic acid was selected as the hapten for the heterologous coating antigen. Next, a highly sensitive and specific monoclonal antibody (mAb), 2F7 was screened for the construction of a CG-IA, with an IC50 value of 0.091 ng/mL. For the qualification of imatinib in human serum, the visual limit of detection (vLOD) and cut-off values of the CG-IA were 2 and 20 ng/mL, respectively. For quantitative detection, the calculated LOD value of the CG-IA was 0.068 ng/mL, with a linearity range of 1.004 and 23.087 ng/mL. The recovery rate of spiked serum samples was between 88.24 % and 104.75 %. In addition, the concentration of imatinib in the serum samples from 10 patients was detected by CG-IA and revealed a good correlation with those from LC-MS/MS. These results indicated that the developed gold-based paper sensor could become an effective tool for the rapid monitoring of imatinib in human serum samples.


Asunto(s)
Inhibidores de Proteínas Quinasas , Espectrometría de Masas en Tándem , Humanos , Mesilato de Imatinib , Cromatografía Liquida , Inmunoensayo/métodos , Oro Coloide/química , Límite de Detección , Cromatografía de Afinidad/métodos
19.
J Clin Periodontol ; 51(5): 547-557, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38212876

RESUMEN

AIM: To develop and validate an automated electronic health record (EHR)-based algorithm to suggest a periodontal diagnosis based on the 2017 World Workshop on the Classification of Periodontal Diseases and Conditions. MATERIALS AND METHODS: Using material published from the 2017 World Workshop, a tool was iteratively developed to suggest a periodontal diagnosis based on clinical data within the EHR. Pertinent clinical data included clinical attachment level (CAL), gingival margin to cemento-enamel junction distance, probing depth, furcation involvement (if present) and mobility. Chart reviews were conducted to confirm the algorithm's ability to accurately extract clinical data from the EHR, and then to test its ability to suggest an accurate diagnosis. Subsequently, refinements were made to address limitations of the data and specific clinical situations. Each refinement was evaluated through chart reviews by expert periodontists at the study sites. RESULTS: Three-hundred and twenty-three charts were manually reviewed, and a periodontal diagnosis (healthy, gingivitis or periodontitis including stage and grade) was made by expert periodontists for each case. After developing the initial version of the algorithm using the unmodified 2017 World Workshop criteria, accuracy was 71.8% for stage alone and 64.7% for stage and grade. Subsequently, 16 modifications to the algorithm were proposed and 14 were accepted. This refined version of the algorithm had 79.6% accuracy for stage alone and 68.8% for stage and grade together. CONCLUSIONS: Our findings suggest that a rule-based algorithm for suggesting a periodontal diagnosis using EHR data can be implemented with moderate accuracy in support of chairside clinical diagnostic decision making, especially for inexperienced clinicians. Grey-zone cases still exist, where clinical judgement will be required. Future applications of similar algorithms with improved performance will depend upon the quality (completeness/accuracy) of EHR data.


Asunto(s)
Gingivitis , Enfermedades Periodontales , Periodontitis , Humanos , Registros Electrónicos de Salud , Enfermedades Periodontales/diagnóstico , Algoritmos
20.
Artículo en Inglés | MEDLINE | ID: mdl-38281112

RESUMEN

IMPORTANCE: The study highlights the potential of large language models, specifically GPT-3.5 and GPT-4, in processing complex clinical data and extracting meaningful information with minimal training data. By developing and refining prompt-based strategies, we can significantly enhance the models' performance, making them viable tools for clinical NER tasks and possibly reducing the reliance on extensive annotated datasets. OBJECTIVES: This study quantifies the capabilities of GPT-3.5 and GPT-4 for clinical named entity recognition (NER) tasks and proposes task-specific prompts to improve their performance. MATERIALS AND METHODS: We evaluated these models on 2 clinical NER tasks: (1) to extract medical problems, treatments, and tests from clinical notes in the MTSamples corpus, following the 2010 i2b2 concept extraction shared task, and (2) to identify nervous system disorder-related adverse events from safety reports in the vaccine adverse event reporting system (VAERS). To improve the GPT models' performance, we developed a clinical task-specific prompt framework that includes (1) baseline prompts with task description and format specification, (2) annotation guideline-based prompts, (3) error analysis-based instructions, and (4) annotated samples for few-shot learning. We assessed each prompt's effectiveness and compared the models to BioClinicalBERT. RESULTS: Using baseline prompts, GPT-3.5 and GPT-4 achieved relaxed F1 scores of 0.634, 0.804 for MTSamples and 0.301, 0.593 for VAERS. Additional prompt components consistently improved model performance. When all 4 components were used, GPT-3.5 and GPT-4 achieved relaxed F1 socres of 0.794, 0.861 for MTSamples and 0.676, 0.736 for VAERS, demonstrating the effectiveness of our prompt framework. Although these results trail BioClinicalBERT (F1 of 0.901 for the MTSamples dataset and 0.802 for the VAERS), it is very promising considering few training samples are needed. DISCUSSION: The study's findings suggest a promising direction in leveraging LLMs for clinical NER tasks. However, while the performance of GPT models improved with task-specific prompts, there's a need for further development and refinement. LLMs like GPT-4 show potential in achieving close performance to state-of-the-art models like BioClinicalBERT, but they still require careful prompt engineering and understanding of task-specific knowledge. The study also underscores the importance of evaluation schemas that accurately reflect the capabilities and performance of LLMs in clinical settings. CONCLUSION: While direct application of GPT models to clinical NER tasks falls short of optimal performance, our task-specific prompt framework, incorporating medical knowledge and training samples, significantly enhances GPT models' feasibility for potential clinical applications.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...