Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 37
Filter
1.
Water Res ; 258: 121753, 2024 May 08.
Article in English | MEDLINE | ID: mdl-38754298

ABSTRACT

Seawater utilization is crucial for the sustainable human development. Despite growing interest in forward osmosis (FO) due to its unique properties, conventional FO membranes with salt-water selectivity have limitations in applying to specific salt-salt separation processes, which hinders their application in resource utilization. In this work, a new concept, "selective forward osmosis (SFO)", was proposed, which ingeniously employed an SFO membrane consisting of an ion-selective layer on a denser substrate. The denser substrate is designed to control water flux so as to alleviate the solution dilution and improve the salt-salt separation. Moreover, the sucrose and pure water were used separately as feed solution to provide different water flux to influence the various salt fluxes, showing that pure water feed could enhance the salt-salt separation efficiency, although it could dilute the draw solution to some extent. Therefore, pure water was selected as feed in the subsequent experiments. The optimized SFO membrane achieved high Na2SO4/NaCl selectivity (∼54.8) and MgCl2/NaCl selectivity (∼9.2) in single-salt draw solutions. With mixed-salt and heavy-metal-mixed-salt draw solutions, the Mg2+/Na+ selectivity was enhanced to ∼14.5, and further to 29.3. In real seawater tests, the SFO system effectively permeated monovalent elements (such as Na flux of ∼68.6 g m-2 h-1) while maintaining a higher rejection for bivalent elements (such as Mg flux of ∼0.08 g m-2 h-1), showing high selectivities for Mg/Na, U/Na, Sr/Na, Ni/Na, and Ca/Na. These results demonstrate the potential of SFO for resource utilization, especially in complex saline environments. This work contributes a new route for salt-salt separation in the pretreatment of seawater resources.

2.
J Dairy Sci ; 107(4): 1928-1949, 2024 Apr.
Article in English | MEDLINE | ID: mdl-37939838

ABSTRACT

This study evaluated 75 strains of lactic acid bacteria (LAB) isolated from traditional dairy products in western China for their probiotic properties. Among them, Limosilactobacillus fermentum WXZ 2-1, Lactiplantibacillus plantarum TXZ 2-35, Companilactobacillus crustorum QHS 9, and Companilactobacillus crustorum QHS 10 demonstrated potential probiotic characteristics. The antioxidant capacity of these 4 strains was assessed, revealing that L. fermentum WXZ 2-1 exhibited the highest antioxidant capacity. Furthermore, when cocultured with Streptococcus salivarius ssp. thermophilus and Lactobacillus delbrueckii ssp. bulgaricus, L. fermentum WXZ 2-1 demonstrated a synergistic effect in growth medium and goat milk. To explore its effect on goat milk fermentation, different amounts of L. fermentum WXZ 2-1 were added to goat milk, and its physicochemical properties, antioxidant activity, flavor substances, and metabolomics were analyzed. The study found that the incorporation of L. fermentum WXZ 2-1 in goat milk fermentation significantly improved the texture characteristics, antioxidant capacity, and flavor of fermented goat milk. These findings highlight the potential of L. fermentum WXZ 2-1 as a valuable probiotic strain for enhancing the functionality and desirability of fermented goat milk, contributing to the development of novel functional foods with improved health benefits and enhanced quality attributes.


Subject(s)
Lactobacillus delbrueckii , Lactobacillus plantarum , Limosilactobacillus fermentum , Probiotics , Animals , Milk/chemistry , Antioxidants/metabolism , Lactobacillus plantarum/metabolism , Lactobacillus delbrueckii/metabolism , Goats/metabolism , Fermentation , Probiotics/metabolism
3.
Nat Commun ; 14(1): 8180, 2023 Dec 11.
Article in English | MEDLINE | ID: mdl-38081829

ABSTRACT

Target trial emulation is the process of mimicking target randomized trials using real-world data, where effective confounding control for unbiased treatment effect estimation remains a main challenge. Although various approaches have been proposed for this challenge, a systematic evaluation is still lacking. Here we emulated trials for thousands of medications from two large-scale real-world data warehouses, covering over 10 years of clinical records for over 170 million patients, aiming to identify new indications of approved drugs for Alzheimer's disease. We assessed different propensity score models under the inverse probability of treatment weighting framework and suggested a model selection strategy for improved baseline covariate balancing. We also found that the deep learning-based propensity score model did not necessarily outperform logistic regression-based methods in covariate balancing. Finally, we highlighted five top-ranked drugs (pantoprazole, gabapentin, atorvastatin, fluticasone, and omeprazole) originally intended for other indications with potential benefits for Alzheimer's patients.


Subject(s)
Alzheimer Disease , Humans , Alzheimer Disease/drug therapy , Drug Repositioning , Propensity Score , Atorvastatin/therapeutic use
4.
Sci Rep ; 13(1): 613, 2023 01 12.
Article in English | MEDLINE | ID: mdl-36635438

ABSTRACT

Overly restrictive eligibility criteria for clinical trials may limit the generalizability of the trial results to their target real-world patient populations. We developed a novel machine learning approach using large collections of real-world data (RWD) to better inform clinical trial eligibility criteria design. We extracted patients' clinical events from electronic health records (EHRs), which include demographics, diagnoses, and drugs, and assumed certain compositions of these clinical events within an individual's EHRs can determine the subphenotypes-homogeneous clusters of patients, where patients within each subgroup share similar clinical characteristics. We introduced an outcome-guided probabilistic model to identify those subphenotypes, such that the patients within the same subgroup not only share similar clinical characteristics but also at similar risk levels of encountering severe adverse events (SAEs). We evaluated our algorithm on two previously conducted clinical trials with EHRs from the OneFlorida+ Clinical Research Consortium. Our model can clearly identify the patient subgroups who are more likely to suffer or not suffer from SAEs as subphenotypes in a transparent and interpretable way. Our approach identified a set of clinical topics and derived novel patient representations based on them. Each clinical topic represents a certain clinical event composition pattern learned from the patient EHRs. Tested on both trials, patient subgroup (#SAE=0) and patient subgroup (#SAE>0) can be well-separated by k-means clustering using the inferred topics. The inferred topics characterized as likely to align with the patient subgroup (#SAE>0) revealed meaningful combinations of clinical features and can provide data-driven recommendations for refining the exclusion criteria of clinical trials. The proposed supervised topic modeling approach can infer the clinical topics from the subphenotypes with or without SAEs. The potential rules for describing the patient subgroups with SAEs can be further derived to inform the design of clinical trial eligibility criteria.


Subject(s)
Electronic Health Records , Machine Learning , Humans , Algorithms , Eligibility Determination , Models, Statistical , Clinical Trials as Topic
5.
Int J Med Inform ; 170: 104973, 2023 02.
Article in English | MEDLINE | ID: mdl-36577203

ABSTRACT

BACKGROUND: Cognitive tests and biomarkers are the key information to assess the severity and track the progression of Alzheimer's' disease (AD) and AD-related dementias (AD/ADRD), yet, both are often only documented in clinical narratives of patients' electronic health records (EHRs). In this work, we aim to (1) assess the documentation of cognitive tests and biomarkers in EHRs that can be used as real-world endpoints, and (2) identify, extract, and harmonize the different commonly used cognitive tests from clinical narratives using natural language processing (NLP) methods into categorical AD/ADRD severity. METHODS: We developed a rule-based NLP pipeline to extract the cognitive tests and biomarkers from clinical narratives in AD/ADRD patients' EHRs. We aggregated the extracted results to the patient level and harmonized the cognitive test scores into severity categories using cutoffs determined based on both relevant literature and domain knowledge of AD/ADRD clinicians. RESULTS: We identified an AD/ADRD cohort of 48,912 patients from the University of Florida (UF) Health system and identified 7 measurements (6 cognitive tests and 1 biomarker) that are frequently documented in our data. Our NLP pipeline achieved an overall F1-score of 0.9059 across the 7 measurements. Among the 6 cognitive tests, we were able to harmonize 4 cognitive test scores into severity categories, and the population characteristics of patients with different severity were described. We also identified several factors related to the availability of their documentation in EHRs. CONCLUSION: This study demonstrates that our NLP pipelines can extract cognitive tests and biomarkers of AD/ADRD accurately for downstream studies. Although, the documentation of cognitive tests and biomarkers in EHRs appears to be low, RWD is still an important resource for AD/ADRD research. Nevertheless, providing standardized approach to document cognitive tests and biomarkers in EHRS are also warranted.


Subject(s)
Alzheimer Disease , Humans , Alzheimer Disease/diagnosis , Natural Language Processing , Electronic Health Records , Biomarkers , Documentation
6.
FEBS Open Bio ; 12(10): 1828-1838, 2022 10.
Article in English | MEDLINE | ID: mdl-36062491

ABSTRACT

Adipose tissue is a major component for the regulation of energy homeostasis by storage and release of lipids. As a core element of RNA-induced silencing complex, argonaute2 (Ago2) plays critical role in maintenance of systemic metabolic demand. Here, we show that high-fat-diet-fed mice exhibit an increase in body mass alongside systematic insulin resistance and altered rate of energy expenditure. Interestingly, Ago2 expression is associated with obesity and an increased amount of adipose tissue. Moreover, increased levels of Ago2 inhibited the expression of AMPKα by promoting its targeting by miR-148a, the most abundant microRNA in adipose tissues. Those results suggested that Ago2-miR-148a-AMPKα signaling pathway play an important function in the developing obesity and adiposity, and will further provide basic research data for the potential clinical treatment of obesity.


Subject(s)
MicroRNAs , AMP-Activated Protein Kinases/metabolism , Adipose Tissue/metabolism , Animals , Argonaute Proteins , Lipids , Mice , MicroRNAs/genetics , MicroRNAs/metabolism , Obesity/genetics , Obesity/metabolism , Signal Transduction
7.
JCO Clin Cancer Inform ; 6: e2100195, 2022 07.
Article in English | MEDLINE | ID: mdl-35839432

ABSTRACT

PURPOSE: Using real-world data (RWD)-based trial simulation approach, we aim to simulate colorectal cancer (CRC) trials and examine both effectiveness and safety end points in different simulation scenarios. METHODS: We identified five phase III trials comparing new treatment regimens with an US Food and Drug Administration-approved first-line treatment in patients with metastatic CRC (ie, fluorouracil, leucovorin, and irinotecan) as the standard-of-care (SOC) control arm. Using Electronic Health Record-derived data from the OneFlorida network, we defined the study populations and outcome measures using the protocols from the original trials. Our design scenarios were (1) simulation of the SOC fluorouracil, leucovorin, and irinotecan arm and (2) comparative effectiveness research (CER) simulation of the control and experimental arms. For each scenario, we adjusted for random assignment, sampling, and dropout. We used overall survival (OS) and severe adverse events (SAEs) to measure effectiveness and safety. RESULTS: We conducted CER simulations for two trials, and SOC simulations for three trials. The effect sizes of our simulated trials were stable across all simulation runs. Compared with the original trials, we observed longer OS and higher mean number of SAEs in both CER and SOC simulation. In the two CER simulations, hazard ratios associated with death from simulations were similar to that reported in the original trials. Consistent with the original trials, we found higher risk ratios of SAEs in the experiment arm, suggesting potentially higher toxicities from the new treatment regimen. We also observed similar SAE rates across all simulations compared with the original trials. CONCLUSION: In this study, we simulated five CRC trials, and tested two simulation scenarios with several different configurations demonstrated that our simulations can robustly generate effectiveness and safety outcomes comparable with the original trials using real-world data.


Subject(s)
Antineoplastic Combined Chemotherapy Protocols , Colorectal Neoplasms , Antineoplastic Combined Chemotherapy Protocols/adverse effects , Clinical Trials, Phase III as Topic , Colorectal Neoplasms/drug therapy , Fluorouracil/therapeutic use , Humans , Irinotecan/therapeutic use , Leucovorin/therapeutic use
8.
Int J Med Inform ; 165: 104834, 2022 09.
Article in English | MEDLINE | ID: mdl-35863206

ABSTRACT

OBJECTIVE: We summarized a decade of new research focusing on semantic data integration (SDI) since 2009, and we aim to: (1) summarize the state-of-art approaches on integrating health data and information; and (2) identify the main gaps and challenges of integrating health data and information from multiple levels and domains. MATERIALS AND METHODS: We used PubMed as our focus is applications of SDI in biomedical domains and followed the Preferred Reporting Items for Systematic Review and Meta-Analyses (PRISMA) to search and report for relevant studies published between January 1, 2009 and December 31, 2021. We used Covidence-a systematic review management system-to carry out this scoping review. RESULTS: The initial search from PubMed resulted in 5,326 articles using the two sets of keywords. We then removed 44 duplicates and 5,282 articles were retained for abstract screening. After abstract screening, we included 246 articles for full-text screening, among which 87 articles were deemed eligible for full-text extraction. We summarized the 87 articles from four aspects: (1) methods for the global schema; (2) data integration strategies (i.e., federated system vs. data warehousing); (3) the sources of the data; and (4) downstream applications. CONCLUSION: SDI approach can effectively resolve the semantic heterogeneities across different data sources. We identified two key gaps and challenges in existing SDI studies that (1) many of the existing SDI studies used data from only single-level data sources (e.g., integrating individual-level patient records from different hospital systems), and (2) documentation of the data integration processes is sparse, threatening the reproducibility of SDI studies.


Subject(s)
Information Storage and Retrieval , Semantics , Humans , Mass Screening , Reproducibility of Results
9.
Molecules ; 26(23)2021 Nov 26.
Article in English | MEDLINE | ID: mdl-34885759

ABSTRACT

Osteoarthritis is a common multifactorial chronic disease that occurs in articular cartilage, subchondral bone, and periarticular tissue. The pathogenesis of OA is still unclear. To investigate the differences in serum metabolites between OA and the control group, liquid chromatography/mass spectrometry (LC/MS)-based metabolomics was used. To reveal the pathogenesis of OA, 12 SD male rats were randomly divided into control and OA groups using collagenase to induce OA for modeling, and serum was collected 7 days after modeling for testing. The OA group was distinguished from the control group by principal component analysis and orthogonal partial least squares-discriminant analysis, and six biomarkers were finally identified. These biomarkers were metabolized through tryptophan metabolism, glutamate metabolism, nitrogen metabolism, spermidine metabolism, and fatty acid metabolism pathways. The study identified metabolites that may be altered in OA, suggesting a role in OA through relevant metabolic pathways. Metabolomics, as an important tool for studying disease mechanisms, provides useful information for studying the metabolic mechanisms of OA.


Subject(s)
Biomarkers/blood , Cartilage, Articular/metabolism , Metabolomics , Osteoarthritis/blood , Animals , Cartilage, Articular/drug effects , Cartilage, Articular/pathology , Chromatography, Liquid , Collagenases/toxicity , Disease Models, Animal , Fatty Acids/blood , Glutamic Acid/blood , Humans , Mass Spectrometry , Metabolic Networks and Pathways , Metabolome/genetics , Nitrogen/blood , Osteoarthritis/chemically induced , Osteoarthritis/genetics , Osteoarthritis/metabolism , Rats , Spermidine/blood , Tryptophan/blood
10.
Front Pharmacol ; 12: 720866, 2021.
Article in English | MEDLINE | ID: mdl-34630099

ABSTRACT

Pancreatic ß-cell dysfunction is a key link during the progression of type 2 diabetes (T2DM), and SIRT1 participates in the regulation of various physiological activities of islet ß-cells. However, as a key link in signal transduction, it is not clear how SIRT1 is regulated. By TargetScan prediction, we found that miR-204, which is enriched in islets, has highly complementary binding sites with SIRT1. Therefore, we speculate that miR-204 may be the upstream regulatory target of SIRT1 in islets and thus participate in the occurrence of ß-cell dysfunction. In this study, we explored the association between miR-204 and ß-cell dysfunction, the therapeutic effects of berberine (BBR) on ß-cell function and the possible mechanisms. We found that miR-204 increased and SIRT1 mRNA and protein levels decreased significantly in islets both in vivo and in vitro. MIN6 cells induced by palmitic acid exhibited increased apoptosis, and the accumulation of insulin and ATP in the supernatant decreased. Importantly, palmitic acid treatment combined with miR-204 silencing showed opposite changes. MiR-204 overexpression in MIN6 cells increased apoptosis and decreased insulin and ATP production and SIRT1 expression. SIRT1 overexpression reversed the damage to ß-cells caused by miR-204. The BBR treatment effectively improved insulin synthesis, reduced miR-204 levels, and increased SIRT1 expression in islet tissue in diabetic mice. Overexpression of miR-204 reversed the protective effect of BBR on apoptosis and insulin secretion in MIN6 cells. Our study identifies a novel correlation between miR-204 and ß-cell dysfunction in T2DM and shows that administration of BBR leads to remission of ß-cell dysfunction by regulating the miR-204/SIRT1 pathway.

11.
Molecules ; 26(18)2021 Sep 20.
Article in English | MEDLINE | ID: mdl-34577169

ABSTRACT

Artemisinin (also known as Qinghaosu), an active component of the Qinghao extract, is widely used as antimalarial drug. Previous studies reveal that artemisinin and its derivatives also have effective anti-inflammatory and immunomodulatory properties, but the direct molecular target remains unknown. Recently, several reports mentioned that myeloid differentiation factor 2 (MD-2, also known as lymphocyte antigen 96) may be the endogenous target of artemisinin in the inhibition of lipopolysaccharide signaling. However, the exact interaction between artemisinin and MD-2 is still not fully understood. Here, experimental and computational methods were employed to elucidate the relationship between the artemisinin and its inhibition mechanism. Experimental results showed that artemether exhibit higher anti-inflammatory activity performance than artemisinin and artesunate. Molecular docking results showed that artemisinin, artesunate, and artemether had similar binding poses, and all complexes remained stable throughout the whole molecular dynamics simulations, whereas the binding of artemisinin and its derivatives to MD-2 decreased the TLR4(Toll-Like Receptor 4)/MD-2 stability. Moreover, artemether exhibited lower binding energy as compared to artemisinin and artesunate, which is in good agreement with the experimental results. Leu61, Leu78, and Ile117 are indeed key residues that contribute to the binding free energy. Binding free energy analysis further confirmed that hydrophobic interactions were critical to maintain the binding mode of artemisinin and its derivatives with MD-2.


Subject(s)
Anti-Inflammatory Agents/chemistry , Anti-Inflammatory Agents/pharmacology , Artemisinins/chemistry , Artemisinins/pharmacology , Lymphocyte Antigen 96/antagonists & inhibitors , Lymphocyte Antigen 96/chemistry , Animals , Artemether/pharmacology , Artesunate/pharmacology , Binding Sites/drug effects , Cell Line , Cell Survival/drug effects , Fatty Acid-Binding Proteins/metabolism , Hydrophobic and Hydrophilic Interactions , Immunomodulation/drug effects , In Vitro Techniques , Lipopolysaccharides/toxicity , Mice , Microglia/drug effects , Molecular Docking Simulation , Molecular Dynamics Simulation , Nitric Oxide/metabolism , Protein Binding , Thermodynamics , Toll-Like Receptor 4/antagonists & inhibitors , Toll-Like Receptor 4/chemistry , Toll-Like Receptor 4/metabolism , Tumor Necrosis Factor-alpha/metabolism
12.
NPJ Digit Med ; 4(1): 84, 2021 May 14.
Article in English | MEDLINE | ID: mdl-33990663

ABSTRACT

In this study, we explored the feasibility of using real-world data (RWD) from a large clinical research network to simulate real-world clinical trials of Alzheimer's disease (AD). The target trial (i.e., NCT00478205) is a Phase III double-blind, parallel-group trial that compared the 23 mg donepezil sustained release with the 10 mg donepezil immediate release formulation in patients with moderate to severe AD. We followed the target trial's study protocol to identify the study population, treatment regimen assignments and outcome assessments, and to set up a number of different simulation scenarios and parameters. We considered two main scenarios: (1) a one-arm simulation: simulating a standard-of-care (SOC) arm that can serve as an external control arm; and (2) a two-arm simulation: simulating both intervention and control arms with proper patient matching algorithms for comparative effectiveness analysis. In the two-arm simulation scenario, we used propensity score matching controlling for baseline characteristics to simulate the randomization process. In the two-arm simulation, higher serious adverse event (SAE) rates were observed in the simulated trials than the rates reported in original trial, and a higher SAE rate was observed in the 23 mg arm than in the 10 mg SOC arm. In the one-arm simulation scenario, similar estimates of SAE rates were observed when proportional sampling was used to control demographic variables. In conclusion, trial simulation using RWD is feasible in this example of AD trial in terms of safety evaluation. Trial simulation using RWD could be a valuable tool for post-market comparative effectiveness studies and for informing future trials' design. Nevertheless, such an approach may be limited, for example, by the availability of RWD that matches the target trials of interest, and further investigations are warranted.

13.
Environ Res ; 197: 111185, 2021 06.
Article in English | MEDLINE | ID: mdl-33901445

ABSTRACT

An individual's health and conditions are associated with a complex interplay between the individual's genetics and his or her exposures to both internal and external environments. Much attention has been placed on characterizing of the genome in the past; nevertheless, genetics only account for about 10% of an individual's health conditions, while the remaining appears to be determined by environmental factors and gene-environment interactions. To comprehensively understand the causes of diseases and prevent them, environmental exposures, especially the external exposome, need to be systematically explored. However, the heterogeneity of the external exposome data sources (e.g., same exposure variables using different nomenclature in different data sources, or vice versa, two variables have the same or similar name but measure different exposures in reality) increases the difficulty of analyzing and understanding the associations between environmental exposures and health outcomes. To solve the issue, the development of semantic standards using an ontology-driven approach is inevitable because ontologies can (1) provide a unambiguous and consistent understanding of the variables in heterogeneous data sources, and (2) explicitly express and model the context of the variables and relationships between those variables. We conducted a review of existing ontology for the external exposome and found only four relevant ontologies. Further, the four existing ontologies are limited: they (1) often ignored the spatiotemporal characteristics of external exposome data, and (2) were developed in isolation from other conceptual frameworks (e.g., the socioecological model and the social determinants of health). Moving forward, the combination of multi-domain and multi-scale data (i.e., genome, phenome and exposome at different granularity) and different conceptual frameworks is the basis of health outcomes research in the future.


Subject(s)
Exposome , Causality , Environmental Exposure , Female , Humans , Male , Semantics
14.
JAMIA Open ; 4(1): ooab026, 2021 Jan.
Article in English | MEDLINE | ID: mdl-33855274

ABSTRACT

OBJECTIVE: Dietary supplements are widely used. However, dietary supplements are not always safe. For example, an estimated 23 000 emergency room visits every year in the United States were attributed to adverse events related to dietary supplement use. With the rapid development of the Internet, consumers usually seek health information including dietary supplement information online. To help consumers access quality online dietary supplement information, we have identified trustworthy dietary supplement information sources and built an evidence-based knowledge base of dietary supplement information-the integrated DIetary Supplement Knowledge base (iDISK) that integrates and standardizes dietary supplement related information across these different sources. However, as information in iDISK was collected from scientific sources, the complex medical jargon is a barrier for consumers' comprehension. The objective of this study is to assess how different approaches to simplify and represent dietary supplement information from iDISK will affect lay consumers' comprehension. MATERIALS AND METHODS: Using a crowdsourcing platform, we recruited participants to read dietary supplement information in 4 different representations from iDISK: (1) original text, (2) syntactic and lexical text simplification (TS), (3) manual TS, and (4) a graph-based visualization. We then assessed how the different simplification and representation strategies affected consumers' comprehension of dietary supplement information in terms of accuracy and response time to a set of comprehension questions. RESULTS: With responses from 690 qualified participants, our experiments confirmed that the manual approach, as expected, had the best performance for both accuracy and response time to the comprehension questions, while the graph-based approach ranked the second outperforming other representations. In some cases, the graph-based representation outperformed the manual approach in terms of response time. CONCLUSIONS: A hybrid approach that combines text and graph-based representations might be needed to accommodate consumers' different information needs and information seeking behavior.

15.
AMIA Annu Symp Proc ; 2021: 716-725, 2021.
Article in English | MEDLINE | ID: mdl-35308944

ABSTRACT

Recently, there has been a growing interest in using real-world data (RWD) to generate real-world evidence that complements clinical trials. To quantify treatment effects, it is important to develop meaningful RWD-based endpoints. In cancer trials, two real-world endpoints are of particular interest: real-world overall survival (rwOS) and real-world time to next treatment (rwTTNT). In this work, we identified ways to calculate these real-world endpoints with structured electronic health record (EHR) data and validate these endpoints against the gold-standard measurements of these endpoints derived from linked EHR and tumor registry (TR) data. In addition, we examined and reported data quality issues, especially inconsistencies between the EHR and TR data. Using a survival model, we show that the presence of next treatment was not significantly associated with rwOS, but patients who had longer rwTTNT had longer rwOS, validating the use of rwTTNT as a real-world surrogate marker for measuring cancer endpoints.


Subject(s)
Neoplasms , Data Accuracy , Electronic Health Records , Humans , Neoplasms/therapy , Registries , Treatment Outcome
16.
JMIR Med Inform ; 8(12): e22982, 2020 Dec 15.
Article in English | MEDLINE | ID: mdl-33320104

ABSTRACT

BACKGROUND: Patients' family history (FH) is a critical risk factor associated with numerous diseases. However, FH information is not well captured in the structured database but often documented in clinical narratives. Natural language processing (NLP) is the key technology to extract patients' FH from clinical narratives. In 2019, the National NLP Clinical Challenge (n2c2) organized shared tasks to solicit NLP methods for FH information extraction. OBJECTIVE: This study presents our end-to-end FH extraction system developed during the 2019 n2c2 open shared task as well as the new transformer-based models that we developed after the challenge. We seek to develop a machine learning-based solution for FH information extraction without task-specific rules created by hand. METHODS: We developed deep learning-based systems for FH concept extraction and relation identification. We explored deep learning models including long short-term memory-conditional random fields and bidirectional encoder representations from transformers (BERT) as well as developed ensemble models using a majority voting strategy. To further optimize performance, we systematically compared 3 different strategies to use BERT output representations for relation identification. RESULTS: Our system was among the top-ranked systems (3 out of 21) in the challenge. Our best system achieved micro-averaged F1 scores of 0.7944 and 0.6544 for concept extraction and relation identification, respectively. After challenge, we further explored new transformer-based models and improved the performances of both subtasks to 0.8249 and 0.6775, respectively. For relation identification, our system achieved a performance comparable to the best system (0.6810) reported in the challenge. CONCLUSIONS: This study demonstrated the feasibility of utilizing deep learning methods to extract FH information from clinical narratives.

17.
BMC Med Inform Decis Mak ; 20(Suppl 4): 292, 2020 12 14.
Article in English | MEDLINE | ID: mdl-33317497

ABSTRACT

BACKGROUND: To reduce cancer mortality and improve cancer outcomes, it is critical to understand the various cancer risk factors (RFs) across different domains (e.g., genetic, environmental, and behavioral risk factors) and levels (e.g., individual, interpersonal, and community levels). However, prior research on RFs of cancer outcomes, has primarily focused on individual level RFs due to the lack of integrated datasets that contain multi-level, multi-domain RFs. Further, the lack of a consensus and proper guidance on systematically identify RFs also increase the difficulty of RF selection from heterogenous data sources in a multi-level integrative data analysis (mIDA) study. More importantly, as mIDA studies require integrating heterogenous data sources, the data integration processes in the limited number of existing mIDA studies are inconsistently performed and poorly documented, and thus threatening transparency and reproducibility. METHODS: Informed by the National Institute on Minority Health and Health Disparities (NIMHD) research framework, we (1) reviewed existing reporting guidelines from the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) network and (2) developed a theory-driven reporting guideline to guide the RF variable selection, data source selection, and data integration process. Then, we developed an ontology to standardize the documentation of the RF selection and data integration process in mIDA studies. RESULTS: We summarized the review results and created a reporting guideline-ATTEST-for reporting the variable selection and data source selection and integration process. We provided an ATTEST check list to help researchers to annotate and clearly document each step of their mIDA studies to ensure the transparency and reproducibility. We used the ATTEST to report two mIDA case studies and further transformed annotation results into sematic triples, so that the relationships among variables, data sources and integration processes are explicitly standardized and modeled using the classes and properties from OD-ATTEST. CONCLUSION: Our ontology-based reporting guideline solves some key challenges in current mIDA studies for cancer outcomes research, through providing (1) a theory-driven guidance for multi-level and multi-domain RF variable and data source selection; and (2) a standardized documentation of the data selection and integration processes powered by an ontology, thus a way to enable sharing of mIDA study reports among researchers.


Subject(s)
Neoplasms , Documentation , Humans , Information Storage and Retrieval , Neoplasms/genetics , Outcome Assessment, Health Care , Reproducibility of Results
18.
JMIR Med Inform ; 8(11): e19735, 2020 Nov 23.
Article in English | MEDLINE | ID: mdl-33226350

ABSTRACT

BACKGROUND: Semantic textual similarity (STS) is one of the fundamental tasks in natural language processing (NLP). Many shared tasks and corpora for STS have been organized and curated in the general English domain; however, such resources are limited in the biomedical domain. In 2019, the National NLP Clinical Challenges (n2c2) challenge developed a comprehensive clinical STS dataset and organized a community effort to solicit state-of-the-art solutions for clinical STS. OBJECTIVE: This study presents our transformer-based clinical STS models developed during this challenge as well as new models we explored after the challenge. This project is part of the 2019 n2c2/Open Health NLP shared task on clinical STS. METHODS: In this study, we explored 3 transformer-based models for clinical STS: Bidirectional Encoder Representations from Transformers (BERT), XLNet, and Robustly optimized BERT approach (RoBERTa). We examined transformer models pretrained using both general English text and clinical text. We also explored using a general English STS dataset as a supplementary corpus in addition to the clinical training set developed in this challenge. Furthermore, we investigated various ensemble methods to combine different transformer models. RESULTS: Our best submission based on the XLNet model achieved the third-best performance (Pearson correlation of 0.8864) in this challenge. After the challenge, we further explored other transformer models and improved the performance to 0.9065 using a RoBERTa model, which outperformed the best-performing system developed in this challenge (Pearson correlation of 0.9010). CONCLUSIONS: This study demonstrated the efficiency of utilizing transformer-based models to measure semantic similarity for clinical text. Our models can be applied to clinical applications such as clinical text deduplication and summarization.

19.
J Biomed Inform ; 110: 103571, 2020 10.
Article in English | MEDLINE | ID: mdl-32961307

ABSTRACT

BACKGROUND: One in five U.S. adults lives with some kind of mental health condition and 4.6% of all U.S. adults have a serious mental illness. The Internet has become the first place for these people to seek online mental health information for help. However, online mental health information is not well-organized and often of low quality. There have been efforts in building evidence-based mental health knowledgebases curated with information manually extracted from the high-quality scientific literature. Manual extraction is inefficient. Crowdsourcing can potentially be a low-cost mechanism to collect labeled data from non-expert laypeople. However, there is not an existing annotation tool integrated with popular crowdsourcing platforms to perform the information extraction tasks. In our previous work, we prototyped a Semantic Text Annotation Tool (STAT) to address this gap. OBJECTIVE: We aimed to refine the STAT prototype (1) to improve its usability and (2) to enhance the crowdsourcing workflow efficiency to facilitate the construction of evidence-based mental health knowledgebase, following a user-centered design (UCD) approach. METHODS: Following UCD principles, we conducted four design iterations to improve the initial STAT prototype. In the first two iterations, usability testing focus groups were conducted internally with 8 participants recruited from a convenient sample, and the usability was evaluated with a modified System Usability Scale (SUS). In the following two iterations, usability testing was conducted externally using the Amazon Mechanical Turk (MTurk) platform. In each iteration, we summarized the usability testing results through thematic analysis, identified usability issues, and conducted a heuristic evaluation to map identified usability issues to Jakob Nielsen's usability heuristics. We collected suggested improvements in the usability testing sessions and enhanced STAT accordingly in the next UCD iteration. After four UCD iterations, we conducted a case study of the system on MTurk using mental health related scientific literature. We compared the performance of crowdsourcing workers with two expert annotators from two aspects: efficiency and quality. RESULTS: The SUS score increased from 70.3 ± 12.5 to 81.1 ± 9.8 after the two internal UCD iterations as we improved STAT's functionality based on the suggested improvements. We then evaluated STAT externally through MTurk in the following two iterations. The SUS score decreased to 55.7 ± 20.1 in the third iteration, probably because of the complexity of the tasks. After further simplification of STAT and the annotation tasks with an improved annotation guideline, the SUS score increased to 73.8 ± 13.8 in the fourth iteration of UCD. In the evaluation case study, on average, the workers spent 125.5 ± 69.2 s on the onboarding tutorial and the crowdsourcing workers spent significantly less time on the annotation tasks compared to the two experts. In terms of annotation quality, the workers' annotation results achieved average F1-scores ranged from 0.62 to 0.84 for the different sentences. CONCLUSIONS: We successfully developed a web-based semantic text annotation tool, STAT, to facilitate the curation of semantic web knowledgebases through four UCD iterations. The lessons learned from the UCD process could serve as a guide to further enhance STAT and the development and design of other crowdsourcing-based semantic text annotation tasks. Our study also showed that a well-organized, informative annotation guideline is as important as the annotation tool itself. Further, we learned that a crowdsourcing task should consist of multiple simple microtasks rather than a complicated task.


Subject(s)
Crowdsourcing , Adult , Humans , Internet , Knowledge Bases , Mental Health , Semantics , User-Centered Design
20.
Onco Targets Ther ; 13: 5223-5230, 2020.
Article in English | MEDLINE | ID: mdl-32606742

ABSTRACT

PURPOSE: Hepatocellular carcinoma (HCC) is one of the most malignant cancers around the world. HCC is less sensitive to conventional cytotoxic agents and easily develops into systemic metastases. However, the molecular mechanisms of the metastasis of HCC are poorly understood and need elucidation. MATERIALS AND METHODS: Transwell system of the chemotherapy-challenged and unchallenged HepG2 cells was established. Adhesion assay and scratch-wound assay were utilized to analyze the adhesion and migration of HepG2 cells. iPLA2 and LOX-5 expression were analyzed by Western blot. LTB4 level was analyzed by ELISA. RESULTS: Chemotherapeutics are traditionally regarded as a way of killing tumor cells; on the other hand, we proved that the chemotherapeutics-induced tumor cell apoptosis can also change the tumor microenvironment by activating the LOX pathway and subsequently release inflammatory factors such as LTB4 which can stimulate the adhesion and migration of the small number of surviving cells. Berberine can reverse the adhesion and migration of HepG2 cells by inhibiting the expression of LOX-5 and reducing the LTB4 production in the tumor microenvironment. CONCLUSION: Our study sheds light on a novel anti-metastasis strategy that the combination of Berberine and chemotherapy may prevent the chemotherapy-induced metastasis in HCC.

SELECTION OF CITATIONS
SEARCH DETAIL
...