RESUMO
Analysis of health-related texts can be used to detect adverse drug reactions (ADR). The greatest challenge for ADR detection lies in imbalanced data distributions where words related to ADR symptoms are often minority classes. As a result, trained models tend to converge to a point that strongly biases towards the majority class and then ignores the minority class. Since the most used cross-entropy criteria is an approximation to accuracy, the model focuses more readily on the majority class to achieve high accuracy. To address this issue, existing methods apply either oversampling or down-sampling strategies to balance the data distribution and exploit the most difficult samples of the minority class. However, increasing or reducing the number of individual tokens alone in sequence labeling tasks will result in the loss of the syntactic relations of the sentence. This paper proposes a weighted variant of conditional random field (CRF) for data-imbalanced sequence labeling tasks. Such a weighting strategy can alleviate data distribution imbalances between majority and minority classes. Instead of using softmax in the output layer, the CRF can capture the relationship of labels between tokens. The locally interpretable model-agnostic explanations (LIME) algorithm was applied to investigate performance differences between models with and without the weighted loss function. Experimental results on two different ADR tasks show that the proposed model outperforms previously proposed sequence labeling methods.
Assuntos
Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Algoritmos , Coleta de Dados , HumanosRESUMO
BACKGROUND: Feelings of depression can be caused by negative life events (NLE) such as the death of a family member, a quarrel with one's spouse, job loss, or strong criticism from an authority figure. The automatic and accurate identification of negative life event language patterns (NLE-LP) can help identify individuals potentially in need of psychiatric services. An NLE-LP combines a person (subject) and a reasonable negative life event (action), e.g.
Assuntos
Idioma , Acontecimentos que Mudam a Vida , Resolução de Problemas , Tomada de Decisão Clínica , Humanos , Serviços de Saúde Mental , Avaliação das NecessidadesRESUMO
BACKGROUND: Online psychiatric texts are natural language texts expressing depressive problems, published by Internet users via community-based web services such as web forums, message boards and blogs. Understanding the cause-effect relations embedded in these psychiatric texts can provide insight into the authors' problems, thus increasing the effectiveness of online psychiatric services. METHODS: Previous studies have proposed the use of word pairs extracted from a set of sentence pairs to identify cause-effect relations between sentences. A word pair is made up of two words, with one coming from the cause text span and the other from the effect text span. Analysis of the relationship between these words can be used to capture individual word associations between cause and effect sentences. For instance, (broke up, life) and (boyfriend, meaningless) are two word pairs extracted from the sentence pair: "I broke up with my boyfriend. Life is now meaningless to me". The major limitation of word pairs is that individual words in sentences usually cannot reflect the exact meaning of the cause and effect events, and thus may produce semantically incomplete word pairs, as the previous examples show. Therefore, this study proposes the use of inter-sentential language patterns such as âªbroke up, boyfriend>, Assuntos
Causalidade
, Transtorno Depressivo/epidemiologia
, Semântica
, Mineração de Dados/métodos
, Bases de Dados Factuais
, Transtorno Depressivo/prevenção & controle
, Humanos
, Armazenamento e Recuperação da Informação/tendências
, Internet
, Acontecimentos que Mudam a Vida
RESUMO
PURPOSE: Negative life events, such as the death of a family member, an argument with a spouse or the loss of a job, play an important role in triggering depressive episodes. Therefore, it is worthwhile to develop psychiatric services that can automatically identify such events. This study describes the use of association language patterns, i.e., meaningful combinations of words (e.g.,
Assuntos
Mineração de Dados/métodos , Transtorno Depressivo/prevenção & controle , Acontecimentos que Mudam a Vida , Modelos Teóricos , Processamento de Linguagem Natural , Semântica , Algoritmos , Teorema de Bayes , Bases de Dados Factuais , Transtorno Depressivo/diagnóstico , Humanos , Internet , Apoio SocialRESUMO
Psychiatric consultation record retrieval attempts to help people to efficiently and effectively locate the consultation records relevant to their depressive problems. Consultation records can also make people aware that they are not alone, because many individuals have suffered from the same or similar problems. Additionally, people can understand how to alleviate their depressive symptoms according to recommendations from health professionals. To achieve this goal, this paper proposes the use of a scenario-based representation, i.e., a symptom-based structural representation, to capture the depressive symptoms and their semantic relations, such as cause-effect and temporal relations, for understanding users' queries clearly. The symptoms and relations are identified from semantic mining and analysis of consultation records. The multilevel mixture model is adopted to estimate the relevance of queries and consultation records based on the structural information. Experimental results show that the proposed approach achieves higher precision than does a term-based flat representation. An experiment is also conducted to examine the effect of error propagation resulting from incorrect identification of symptoms and relations. Experimental results demonstrate that combining different approaches can improve the retrieval robustness.
Assuntos
Sistemas de Gerenciamento de Base de Dados , Armazenamento e Recuperação da Informação/métodos , Anamnese/métodos , Sistemas Computadorizados de Registros Médicos , Processamento de Linguagem Natural , Psiquiatria/métodos , Encaminhamento e Consulta , Inteligência Artificial , Técnicas de Apoio para a Decisão , Medical Subject HeadingsRESUMO
Massive open online courses (MOOCs) have recently gained worldwide attention from educational institutes. MOOCs provide a new option for learning, yet measurable learning benefits of MOOCs still need to be investigated. Collecting data of three MOOCs at Yuan Ze University (YZU), this paper intended to classify learning behaviors among 1489 students on the MOOC platform at YZU. This study further examined learning outcomes in MOOCs by different types of learners. The Ward's hierarchical and k-means non-hierarchical clustering methods were employed to classify types of learners' behavior while they engaged in learning activities on the MOOC platform. Three types of MOOC learners were classified-active learner, passive learner, and bystander. Active learners who submitted assignments on time and frequently watched lecture videos showed a higher completion rate and a better grade in the course. MOOC learners who participated in online discussion forum reported a higher rate of passing the course and a better score than those inactive classmates. The finding of this study suggested that the first 2 weeks was a critical point of time to retain students in MOOCs. MOOC instructors need to carefully design course and detect risk behaviors of students in early of the classes to prevent students from dropping out of the course. The feature design of discussion forum is to provide peer interaction and facilitate online learning. Our results suggested that timely feedback by instructors or facilitators on discussion forum could enhance students' engagement in MOOCs.