Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
1.
PLoS Comput Biol ; 18(6): e1010144, 2022 06.
Artigo em Inglês | MEDLINE | ID: mdl-35704662

RESUMO

Analysis of health-related texts can be used to detect adverse drug reactions (ADR). The greatest challenge for ADR detection lies in imbalanced data distributions where words related to ADR symptoms are often minority classes. As a result, trained models tend to converge to a point that strongly biases towards the majority class and then ignores the minority class. Since the most used cross-entropy criteria is an approximation to accuracy, the model focuses more readily on the majority class to achieve high accuracy. To address this issue, existing methods apply either oversampling or down-sampling strategies to balance the data distribution and exploit the most difficult samples of the minority class. However, increasing or reducing the number of individual tokens alone in sequence labeling tasks will result in the loss of the syntactic relations of the sentence. This paper proposes a weighted variant of conditional random field (CRF) for data-imbalanced sequence labeling tasks. Such a weighting strategy can alleviate data distribution imbalances between majority and minority classes. Instead of using softmax in the output layer, the CRF can capture the relationship of labels between tokens. The locally interpretable model-agnostic explanations (LIME) algorithm was applied to investigate performance differences between models with and without the weighted loss function. Experimental results on two different ADR tasks show that the proposed model outperforms previously proposed sequence labeling methods.


Assuntos
Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Algoritmos , Coleta de Dados , Humanos
2.
BMC Med Inform Decis Mak ; 19(1): 173, 2019 08 28.
Artigo em Inglês | MEDLINE | ID: mdl-31455389

RESUMO

BACKGROUND: Feelings of depression can be caused by negative life events (NLE) such as the death of a family member, a quarrel with one's spouse, job loss, or strong criticism from an authority figure. The automatic and accurate identification of negative life event language patterns (NLE-LP) can help identify individuals potentially in need of psychiatric services. An NLE-LP combines a person (subject) and a reasonable negative life event (action), e.g. or < boyfriend:break_up>. METHODS: This paper proposes an analogical reasoning framework which combines a word representation approach and a pattern inference method to mine/extract NLE-LPs from psychiatric consultation documents. Word representation approaches such as skip-gram (SG) and continuous bag-of-words (CBOW) are used to generate word embeddings. Pattern inference methods such as cosine similarity (COSINE) and cosine multiplication similarity (COSMUL) are used to infer patterns. RESULTS: Experimental results show our proposed analogical reasoning framework outperforms the traditional methods such as positive pairwise mutual information (PPMI) and hyperspace analog to language (HAL), and can effectively mine highly precise NLE-LPs based on word embeddings. CBOW with COSINE of analogical reasoning is the best word representation and inference engine. In addition, both word embeddings and the inference engine provided by the analogical reasoning framework can further be used to improve the HAL model. CONCLUSIONS: Our proposed framework is a very simple matching function based on these word representation approaches and is applied to significantly improve HAL model mining performance.


Assuntos
Idioma , Acontecimentos que Mudam a Vida , Resolução de Problemas , Tomada de Decisão Clínica , Humanos , Serviços de Saúde Mental , Avaliação das Necessidades
3.
BMC Med Inform Decis Mak ; 12: 72, 2012 Jul 18.
Artigo em Inglês | MEDLINE | ID: mdl-22809317

RESUMO

BACKGROUND: Online psychiatric texts are natural language texts expressing depressive problems, published by Internet users via community-based web services such as web forums, message boards and blogs. Understanding the cause-effect relations embedded in these psychiatric texts can provide insight into the authors' problems, thus increasing the effectiveness of online psychiatric services. METHODS: Previous studies have proposed the use of word pairs extracted from a set of sentence pairs to identify cause-effect relations between sentences. A word pair is made up of two words, with one coming from the cause text span and the other from the effect text span. Analysis of the relationship between these words can be used to capture individual word associations between cause and effect sentences. For instance, (broke up, life) and (boyfriend, meaningless) are two word pairs extracted from the sentence pair: "I broke up with my boyfriend. Life is now meaningless to me". The major limitation of word pairs is that individual words in sentences usually cannot reflect the exact meaning of the cause and effect events, and thus may produce semantically incomplete word pairs, as the previous examples show. Therefore, this study proposes the use of inter-sentential language patterns such as ≪broke up, boyfriend>,

Assuntos
Causalidade , Transtorno Depressivo/epidemiologia , Semântica , Mineração de Dados/métodos , Bases de Dados Factuais , Transtorno Depressivo/prevenção & controle , Humanos , Armazenamento e Recuperação da Informação/tendências , Internet , Acontecimentos que Mudam a Vida
4.
J Biomed Inform ; 44(4): 509-18, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21292030

RESUMO

PURPOSE: Negative life events, such as the death of a family member, an argument with a spouse or the loss of a job, play an important role in triggering depressive episodes. Therefore, it is worthwhile to develop psychiatric services that can automatically identify such events. This study describes the use of association language patterns, i.e., meaningful combinations of words (e.g., ), as features to classify sentences with negative life events into predefined categories (e.g., Family, Love, Work). METHODS: This study proposes a framework that combines a supervised data mining algorithm and an unsupervised distributional semantic model to discover association language patterns. The data mining algorithm, called association rule mining, was used to generate a set of seed patterns by incrementally associating frequently co-occurring words from a small corpus of sentences labeled with negative life events. The distributional semantic model was then used to discover more patterns similar to the seed patterns from a large, unlabeled web corpus. RESULTS: The experimental results showed that association language patterns were significant features for negative life event classification. Additionally, the unsupervised distributional semantic model was not only able to improve the level of performance but also to reduce the reliance of the classification process on the availability of a large, labeled corpus.


Assuntos
Mineração de Dados/métodos , Transtorno Depressivo/prevenção & controle , Acontecimentos que Mudam a Vida , Modelos Teóricos , Processamento de Linguagem Natural , Semântica , Algoritmos , Teorema de Bayes , Bases de Dados Factuais , Transtorno Depressivo/diagnóstico , Humanos , Internet , Apoio Social
5.
IEEE Trans Inf Technol Biomed ; 11(4): 415-27, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17674624

RESUMO

Psychiatric consultation record retrieval attempts to help people to efficiently and effectively locate the consultation records relevant to their depressive problems. Consultation records can also make people aware that they are not alone, because many individuals have suffered from the same or similar problems. Additionally, people can understand how to alleviate their depressive symptoms according to recommendations from health professionals. To achieve this goal, this paper proposes the use of a scenario-based representation, i.e., a symptom-based structural representation, to capture the depressive symptoms and their semantic relations, such as cause-effect and temporal relations, for understanding users' queries clearly. The symptoms and relations are identified from semantic mining and analysis of consultation records. The multilevel mixture model is adopted to estimate the relevance of queries and consultation records based on the structural information. Experimental results show that the proposed approach achieves higher precision than does a term-based flat representation. An experiment is also conducted to examine the effect of error propagation resulting from incorrect identification of symptoms and relations. Experimental results demonstrate that combining different approaches can improve the retrieval robustness.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Armazenamento e Recuperação da Informação/métodos , Anamnese/métodos , Sistemas Computadorizados de Registros Médicos , Processamento de Linguagem Natural , Psiquiatria/métodos , Encaminhamento e Consulta , Inteligência Artificial , Técnicas de Apoio para a Decisão , Medical Subject Headings
6.
Artigo em Inglês | MEDLINE | ID: mdl-30613241

RESUMO

Massive open online courses (MOOCs) have recently gained worldwide attention from educational institutes. MOOCs provide a new option for learning, yet measurable learning benefits of MOOCs still need to be investigated. Collecting data of three MOOCs at Yuan Ze University (YZU), this paper intended to classify learning behaviors among 1489 students on the MOOC platform at YZU. This study further examined learning outcomes in MOOCs by different types of learners. The Ward's hierarchical and k-means non-hierarchical clustering methods were employed to classify types of learners' behavior while they engaged in learning activities on the MOOC platform. Three types of MOOC learners were classified-active learner, passive learner, and bystander. Active learners who submitted assignments on time and frequently watched lecture videos showed a higher completion rate and a better grade in the course. MOOC learners who participated in online discussion forum reported a higher rate of passing the course and a better score than those inactive classmates. The finding of this study suggested that the first 2 weeks was a critical point of time to retain students in MOOCs. MOOC instructors need to carefully design course and detect risk behaviors of students in early of the classes to prevent students from dropping out of the course. The feature design of discussion forum is to provide peer interaction and facilitate online learning. Our results suggested that timely feedback by instructors or facilitators on discussion forum could enhance students' engagement in MOOCs.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA