Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
1.
BMC Med Inform Decis Mak ; 23(Suppl 4): 298, 2024 01 05.
Article in English | MEDLINE | ID: mdl-38183034

ABSTRACT

BACKGROUND: Vaccine Adverse Events ReportingSystem (VAERS) is a promising resource of tracking adverse events following immunization. Medical Dictionary for Regulatory Activities (MedDRA) terminology used for coding adverse events in VAERS reports has several limitations. We focus on developing an automated system for semantic extraction of adverse events following vaccination and their temporal relationships for a better understanding of VAERS data and its integration into other applications. The aim of the present studyis to summarize the lessons learned during the initial phase of this project in annotating adverse events following influenza vaccination and related to Guillain-Barré syndrome (GBS). We emphasize on identifying the limitations of VAERS and MedDRA. RESULTS: We collected 282 VAERS reports documented between 1990 and 2016 and shortlisted those with at least 1,100 characters in the report. We used a subset of 50 reports for the preliminary investigation and annotated all adverse events following influenza vaccination by mapping to representative MedDRA terms. Associated time expressions were annotated when available. We used 16 System Organ Class (SOC) level MedDRA terms to map GBS related adverse events and expanded some SOC terms to Lowest Level Terms (LLT) for granular representation. We annotated three broad categories of events such as problems, clinical investigations, and treatments/procedures. The inter-annotator agreement of events achieved was 86%. Incomplete reports, typographical errors, lack of clarity and coherence, repeated texts, unavailability of associated temporal information, difficulty to interpret due to incorrect grammar, use of generalized terms to describe adverse events / symptoms, uncommon abbreviations, difficulty annotating multiple events with a conjunction / common phrase, irrelevant historical events and coexisting events were some of the challenges encountered. Some of the limitations we noted are in agreement with previous reports. CONCLUSIONS: We reported the challenges encountered and lessons learned during annotation of adverse events in VAERS reports following influenza vaccination and related to GBS. Though the challenges may be due to the inevitable limitations of public reporting systems and widely reported limitations of MedDRA, we emphasize the need to understand these limitations and extraction of other supportive information for a better understanding of adverse events following vaccination.


Subject(s)
Guillain-Barre Syndrome , Influenza, Human , Humans , Guillain-Barre Syndrome/etiology , Adverse Drug Reaction Reporting Systems , Influenza, Human/prevention & control , Vaccination/adverse effects , Linguistics
2.
J Med Internet Res ; 20(7): e236, 2018 07 09.
Article in English | MEDLINE | ID: mdl-29986843

ABSTRACT

BACKGROUND: Timely understanding of public perceptions allows public health agencies to provide up-to-date responses to health crises such as infectious diseases outbreaks. Social media such as Twitter provide an unprecedented way for the prompt assessment of the large-scale public response. OBJECTIVE: The aims of this study were to develop a scheme for a comprehensive public perception analysis of a measles outbreak based on Twitter data and demonstrate the superiority of the convolutional neural network (CNN) models (compared with conventional machine learning methods) on measles outbreak-related tweets classification tasks with a relatively small and highly unbalanced gold standard training set. METHODS: We first designed a comprehensive scheme for the analysis of public perception of measles based on tweets, including 3 dimensions: discussion themes, emotions expressed, and attitude toward vaccination. All 1,154,156 tweets containing the word "measles" posted between December 1, 2014, and April 30, 2015, were purchased and downloaded from DiscoverText.com. Two expert annotators curated a gold standard of 1151 tweets (approximately 0.1% of all tweets) based on the 3-dimensional scheme. Next, a tweet classification system based on the CNN framework was developed. We compared the performance of the CNN models to those of 4 conventional machine learning models and another neural network model. We also compared the impact of different word embeddings configurations for the CNN models: (1) Stanford GloVe embedding trained on billions of tweets in the general domain, (2) measles-specific embedding trained on our 1 million measles related tweets, and (3) a combination of the 2 embeddings. RESULTS: Cohen kappa intercoder reliability values for the annotation were: 0.78, 0.72, and 0.80 on the 3 dimensions, respectively. Class distributions within the gold standard were highly unbalanced for all dimensions. The CNN models performed better on all classification tasks than k-nearest neighbors, naïve Bayes, support vector machines, or random forest. Detailed comparison between support vector machines and the CNN models showed that the major contributor to the overall superiority of the CNN models is the improvement on recall, especially for classes with low occurrence. The CNN model with the 2 embedding combination led to better performance on discussion themes and emotions expressed (microaveraging F1 scores of 0.7811 and 0.8592, respectively), while the CNN model with Stanford embedding achieved best performance on attitude toward vaccination (microaveraging F1 score of 0.8642). CONCLUSIONS: The proposed scheme can successfully classify the public's opinions and emotions in multiple dimensions, which would facilitate the timely understanding of public perceptions during the outbreak of an infectious disease. Compared with conventional machine learning methods, our CNN models showed superiority on measles-related tweet classification tasks with a relatively small and highly unbalanced gold standard. With the success of these tasks, our proposed scheme and CNN-based tweets classification system is expected to be useful for the analysis of tweets about other infectious diseases such as influenza and Ebola.


Subject(s)
Disease Outbreaks/statistics & numerical data , Measles/epidemiology , Neural Networks, Computer , Social Media/trends , History, 21st Century , Humans , Measles/pathology , Perception , Public Opinion , Reproducibility of Results
3.
BMC Med Inform Decis Mak ; 17(Suppl 2): 73, 2017 Jul 05.
Article in English | MEDLINE | ID: mdl-28699547

ABSTRACT

BACKGROUND: Knowledge engineering for ontological knowledgebases is resource and time intensive. To alleviate these issues, especially for novices, automated tools from the natural language domain can assist in the development process of ontologies. We focus towards the development of ontologies for the public health domain and use patient-centric sources from MedlinePlus related to HPV-causing cancers. METHODS: This paper demonstrates the use of a lightweight open information extraction (OIE) tool to derive accurate knowledge triples that can lead to the seeding of an ontological knowledgebase. We developed a custom application, which interfaced with an information extraction software library, to help facilitate the tasks towards producing knowledge triples from textual sources. RESULTS: The results of our efforts generated accurate extractions ranging from 80-89% precision. These triples can later be transformed to OWL/RDF representation for our planned ontological knowledgebase. CONCLUSIONS: OIE delivers an effective and accessible method towards the development ontologies.


Subject(s)
Biological Ontologies , MedlinePlus , Natural Language Processing , Neoplasms , Public Health , Humans
4.
BMC Med Inform Decis Mak ; 17(Suppl 2): 69, 2017 Jul 05.
Article in English | MEDLINE | ID: mdl-28699569

ABSTRACT

BACKGROUND: As one of the serious public health issues, vaccination refusal has been attracting more and more attention, especially for newly approved human papillomavirus (HPV) vaccines. Understanding public opinion towards HPV vaccines, especially concerns on social media, is of significant importance for HPV vaccination promotion. METHODS: In this study, we leveraged a hierarchical machine learning based sentiment analysis system to extract public opinions towards HPV vaccines from Twitter. English tweets containing HPV vaccines-related keywords were collected from November 2, 2015 to March 28, 2016. Manual annotation was done to evaluate the performance of the system on the unannotated tweets corpus. Followed time series analysis was applied to this corpus to track the trends of machine-deduced sentiments and their associations with different days of the week. RESULTS: The evaluation of the unannotated tweets corpus showed that the micro-averaging F scores have reached 0.786. The learning system deduced the sentiment labels for 184,214 tweets in the collected unannotated tweets corpus. Time series analysis identified a coincidence between mainstream outcome and Twitter contents. A weak trend was found for "Negative" tweets that decreased firstly and began to increase later; an opposite trend was identified for "Positive" tweets. Tweets that contain the worries on efficacy for HPV vaccines showed a relative significant decreasing trend. Strong associations were found between some sentiments ("Positive", "Negative", "Negative-Safety" and "Negative-Others") with different days of the week. CONCLUSIONS: Our efforts on sentiment analysis for newly approved HPV vaccines provide us an automatic and instant way to extract public opinion and understand the concerns on Twitter. Our approaches can provide a feedback to public health professionals to monitor online public response, examine the effectiveness of their HPV vaccination promotion strategies and adjust their promotion plans.


Subject(s)
Health Knowledge, Attitudes, Practice , Machine Learning , Papillomavirus Infections/prevention & control , Papillomavirus Vaccines , Public Opinion , Social Media , Vaccination/psychology , Humans
5.
JMIR Serious Games ; 9(1): e23088, 2021 Jan 27.
Article in English | MEDLINE | ID: mdl-33502323

ABSTRACT

BACKGROUND: Early adolescent unintended pregnancy and sexually transmitted infection prevention are significant public health challenges in the United States. Parental influence can help adolescents make responsible and informed sexual health decisions toward delayed sexual debut; yet parents often feel ill equipped to communicate about sex-related topics. Intergenerational games offer a potential strategy to provide life skills training to young adolescents (aged 11-14 years) while engaging them and their parents in communication about sexual health. OBJECTIVE: This study aims to describe the development of a web-based online sexual health intergenerational adventure game, the Secret of Seven Stones (SSS), using an intervention mapping (IM) approach for developing theory- and evidence-based interventions. METHODS: We followed the IM development steps to describe a theoretical and empirical model for young adolescent sexual health behavior, define target behaviors and change objectives, identify theory-based methods and practical applications to inform design and function, develop and test a prototype of 2 game levels to assess feasibility before developing the complete 18-level game, draft an implementation plan that includes a commercial dissemination strategy, and draft an evaluation plan including a study design for a randomized controlled trial efficacy trial of SSS. RESULTS: SSS comprised an adventure game for young adolescent skills training delivered via a desktop computer, a text-based notification system to provide progress updates for parents and cues to initiate dialogue with their 11- to 14-year-old child, and a website for parent skills training and progress monitoring. Formative prototype testing demonstrated feasibility for in-home use and positive usability ratings. CONCLUSIONS: The SSS intergenerational game provides a unique addition to the limited cadre of home-based programs that facilitate parent involvement in influencing young adolescent behaviors and reducing adolescent sexual risk taking. The IM framework provided a logical and thorough approach to development and testing, attentive to the need for theoretical and empirical foundations in serious games for health.

6.
J Am Med Inform Assoc ; 27(7): 1046-1056, 2020 07 01.
Article in English | MEDLINE | ID: mdl-32626903

ABSTRACT

OBJECTIVE: The goal of this study is to develop a robust Time Event Ontology (TEO), which can formally represent and reason both structured and unstructured temporal information. MATERIALS AND METHODS: Using our previous Clinical Narrative Temporal Relation Ontology 1.0 and 2.0 as a starting point, we redesigned concept primitives (clinical events and temporal expressions) and enriched temporal relations. Specifically, 2 sets of temporal relations (Allen's interval algebra and a novel suite of basic time relations) were used to specify qualitative temporal order relations, and a Temporal Relation Statement was designed to formalize quantitative temporal relations. Moreover, a variety of data properties were defined to represent diversified temporal expressions in clinical narratives. RESULTS: TEO has a rich set of classes and properties (object, data, and annotation). When evaluated with real electronic health record data from the Mayo Clinic, it could faithfully represent more than 95% of the temporal expressions. Its reasoning ability was further demonstrated on a sample drug adverse event report annotated with respect to TEO. The results showed that our Java-based TEO reasoner could answer a set of frequently asked time-related queries, demonstrating that TEO has a strong capability of reasoning complex temporal relations. CONCLUSION: TEO can support flexible temporal relation representation and reasoning. Our next step will be to apply TEO to the natural language processing field to facilitate automated temporal information annotation, extraction, and timeline reasoning to better support time-based clinical decision-making.


Subject(s)
Biological Ontologies , Electronic Health Records , Time , Decision Support Systems, Clinical , Humans , Natural Language Processing , Semantic Web
7.
Stud Health Technol Inform ; 264: 1041-1045, 2019 Aug 21.
Article in English | MEDLINE | ID: mdl-31438083

ABSTRACT

Natural language processing (NLP) technologies have been successfully applied to cancer research by enabling automated phenotypic information extraction from narratives in electronic health records (EHRs) such as pathology reports; however, developing customized NLP solutions requires substantial effort. To facilitate the adoption of NLP in cancer research, we have developed a set of customizable modules for extracting comprehensive types of cancer-related information in pathology reports (e.g., tumor size, tumor stage, and biomarkers), by leveraging the existing CLAMP system, which provides user-friendly interfaces for building customized NLP solutions for individual needs. Evaluation using annotated data at Vanderbilt University Medical Center showed that CLAMP-Cancer could extract diverse types of cancer information with good F-measures (0.80-0.98). We then applied CLAMP-Cancer to an information extraction task at Mayo Clinic and showed that we can quickly build a customized NLP system with comparable performance with an existing system at Mayo Clinic. CLAMP-Cancer is freely available for academic use.


Subject(s)
Information Storage and Retrieval , Neoplasms , Electronic Health Records , Humans , Natural Language Processing , Research Report
8.
JMIR Med Inform ; 6(1): e7, 2018 Feb 22.
Article in English | MEDLINE | ID: mdl-29472179

ABSTRACT

BACKGROUND: Today, there is an increasing need to centralize and standardize electronic health data within clinical research as the volume of data continues to balloon. Domain-specific common data elements (CDEs) are emerging as a standard approach to clinical research data capturing and reporting. Recent efforts to standardize clinical study CDEs have been of great benefit in facilitating data integration and data sharing. The importance of the temporal dimension of clinical research studies has been well recognized; however, very few studies have focused on the formal representation of temporal constraints and temporal relationships within clinical research data in the biomedical research community. In particular, temporal information can be extremely powerful to enable high-quality cancer research. OBJECTIVE: The objective of the study was to develop and evaluate an ontological approach to represent the temporal aspects of cancer study CDEs. METHODS: We used CDEs recorded in the National Cancer Institute (NCI) Cancer Data Standards Repository (caDSR) and created a CDE parser to extract time-relevant CDEs from the caDSR. Using the Web Ontology Language (OWL)-based Time Event Ontology (TEO), we manually derived representative patterns to semantically model the temporal components of the CDEs using an observing set of randomly selected time-related CDEs (n=600) to create a set of TEO ontological representation patterns. In evaluating TEO's ability to represent the temporal components of the CDEs, this set of representation patterns was tested against two test sets of randomly selected time-related CDEs (n=425). RESULTS: It was found that 94.2% (801/850) of the CDEs in the test sets could be represented by the TEO representation patterns. CONCLUSIONS: In conclusion, TEO is a good ontological model for representing the temporal components of the CDEs recorded in caDSR. Our representative model can harness the Semantic Web reasoning and inferencing functionalities and present a means for temporal CDEs to be machine-readable, streamlining meaningful searches.

9.
Saf Health ; 1: 7, 2015.
Article in English | MEDLINE | ID: mdl-38770193

ABSTRACT

Background: To learn from errors, electronic patient safety event reporting systems (e-reporting systems) have been widely adopted to collect medical incidents from the frontline practitioners in US hospitals. However, two issues of underreporting and low-quality of reports pervade and thus the system effectiveness remains dubious. Methods: This study employing semi-structured interviews of health professionals in the Texas Medical Center investigated the perceived benefits and barriers from users who have used e-reporting systems. Results: As a result, the perceived benefits include the enhanced convenience in data processing and the assistant functions leading to patient safety enhancement. The perceived barriers to the acceptance and quality use of the system include the lack of instructions, lack of reporter-friendly classifications, lack of time, and lack of feedback The identified benefits and barriers help design a user-centered e-reporting system where learning and assistant features are discussed during the interviews. Conclusions: As a response, the learning and assistant features aiming at enhancing benefits and removing barriers of e-reporting systems should be included for facilitating the acceptance and effective use of the systems.

SELECTION OF CITATIONS
SEARCH DETAIL