Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Nature ; 542(7639): 96-100, 2017 02 02.
Artículo en Inglés | MEDLINE | ID: mdl-28117439

RESUMEN

When faced with threat, the survival of an organism is contingent upon the selection of appropriate active or passive behavioural responses. Freezing is an evolutionarily conserved passive fear response that has been used extensively to study the neuronal mechanisms of fear and fear conditioning in rodents. However, rodents also exhibit active responses such as flight under natural conditions. The central amygdala (CEA) is a forebrain structure vital for the acquisition and expression of conditioned fear responses, and the role of specific neuronal sub-populations of the CEA in freezing behaviour is well-established. Whether the CEA is also involved in flight behaviour, and how neuronal circuits for active and passive fear behaviour interact within the CEA, are not yet understood. Here, using in vivo optogenetics and extracellular recordings of identified cell types in a behavioural model in which mice switch between conditioned freezing and flight, we show that active and passive fear responses are mediated by distinct and mutually inhibitory CEA neurons. Cells expressing corticotropin-releasing factor (CRF+) mediate conditioned flight, and activation of somatostatin-positive (SOM+) neurons initiates passive freezing behaviour. Moreover, we find that the balance between conditioned flight and freezing behaviour is regulated by means of local inhibitory connections between CRF+ and SOM+ neurons, indicating that the selection of appropriate behavioural responses to threat is based on competitive interactions between two defined populations of inhibitory neurons, a circuit motif allowing for rapid and flexible action selection.


Asunto(s)
Reacción de Fuga/fisiología , Miedo/fisiología , Miedo/psicología , Reacción Cataléptica de Congelación/fisiología , Inhibición Neural , Neuronas/fisiología , Animales , Núcleo Amigdalino Central/citología , Núcleo Amigdalino Central/fisiología , Hormona Liberadora de Corticotropina/metabolismo , Locomoción/fisiología , Masculino , Ratones , Ratones Endogámicos C57BL , Modelos Neurológicos , Vías Nerviosas , Optogenética , Somatostatina/metabolismo
2.
J Biomed Inform ; 75S: S28-S33, 2017 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-28602908

RESUMEN

De-identification of clinical narratives is one of the main obstacles to making healthcare free text available for research. In this paper we describe our experience in expanding and tailoring two existing tools as part of the 2016 CEGS N-GRID Shared Tasks Track 1, which evaluated de-identification methods on a set of psychiatric evaluation notes for up to 25 different types of Protected Health Information (PHI). The methods we used rely on machine learning on either a large or small feature space, with additional strategies, including two-pass tagging and multi-class models, which both proved to be beneficial. The results show that the integration of the proposed methods can identify Health Information Portability and Accountability Act (HIPAA) defined PHIs with overall F1-scores of ∼90% and above. Yet, some classes (Profession, Organization) proved again to be challenging given the variability of expressions used to reference given information.


Asunto(s)
Algoritmos , Confidencialidad , Trastornos Mentales/psicología , Health Insurance Portability and Accountability Act , Humanos , Aprendizaje Automático , Estados Unidos
3.
J Biomed Inform ; 58 Suppl: S183-S188, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26133479

RESUMEN

Heart disease is the leading cause of death globally and a significant part of the human population lives with it. A number of risk factors have been recognized as contributing to the disease, including obesity, coronary artery disease (CAD), hypertension, hyperlipidemia, diabetes, smoking, and family history of premature CAD. This paper describes and evaluates a methodology to extract mentions of such risk factors from diabetic clinical notes, which was a task of the i2b2/UTHealth 2014 Challenge in Natural Language Processing for Clinical Data. The methodology is knowledge-driven and the system implements local lexicalized rules (based on syntactical patterns observed in notes) combined with manually constructed dictionaries that characterize the domain. A part of the task was also to detect the time interval in which the risk factors were present in a patient. The system was applied to an evaluation set of 514 unseen notes and achieved a micro-average F-score of 88% (with 86% precision and 90% recall). While the identification of CAD family history, medication and some of the related disease factors (e.g. hypertension, diabetes, hyperlipidemia) showed quite good results, the identification of CAD-specific indicators proved to be more challenging (F-score of 74%). Overall, the results are encouraging and suggested that automated text mining methods can be used to process clinical notes to identify risk factors and monitor progression of heart disease on a large-scale, providing necessary data for clinical and epidemiological studies.


Asunto(s)
Enfermedades Cardiovasculares/epidemiología , Minería de Datos/métodos , Complicaciones de la Diabetes/epidemiología , Registros Electrónicos de Salud/organización & administración , Narración , Procesamiento de Lenguaje Natural , Anciano , Enfermedades Cardiovasculares/diagnóstico , Estudios de Cohortes , Comorbilidad , Seguridad Computacional , Confidencialidad , Complicaciones de la Diabetes/diagnóstico , Femenino , Humanos , Incidencia , Estudios Longitudinales , Masculino , Persona de Mediana Edad , Reconocimiento de Normas Patrones Automatizadas/métodos , Medición de Riesgo/métodos , Semántica , Reino Unido/epidemiología , Vocabulario Controlado
4.
J Biomed Inform ; 58 Suppl: S53-S59, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26210359

RESUMEN

A recent promise to access unstructured clinical data from electronic health records on large-scale has revitalized the interest in automated de-identification of clinical notes, which includes the identification of mentions of Protected Health Information (PHI). We describe the methods developed and evaluated as part of the i2b2/UTHealth 2014 challenge to identify PHI defined by 25 entity types in longitudinal clinical narratives. Our approach combines knowledge-driven (dictionaries and rules) and data-driven (machine learning) methods with a large range of features to address de-identification of specific named entities. In addition, we have devised a two-pass recognition approach that creates a patient-specific run-time dictionary from the PHI entities identified in the first step with high confidence, which is then used in the second pass to identify mentions that lack specific clues. The proposed method achieved the overall micro F1-measures of 91% on strict and 95% on token-level evaluation on the test dataset (514 narratives). Whilst most PHI entities can be reliably identified, particularly challenging were mentions of Organizations and Professions. Still, the overall results suggest that automated text mining methods can be used to reliably process clinical notes to identify personal information and thus providing a crucial step in large-scale de-identification of unstructured data for further clinical and epidemiological studies.


Asunto(s)
Seguridad Computacional , Confidencialidad , Registros Electrónicos de Salud/organización & administración , Narración , Procesamiento de Lenguaje Natural , Reconocimiento de Normas Patrones Automatizadas/métodos , Estudios de Cohortes , Simulación por Computador , Minería de Datos/métodos , Aprendizaje Automático , Modelos Estadísticos , Reino Unido , Vocabulario Controlado
5.
Artif Intell Med ; 151: 102845, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38555848

RESUMEN

BACKGROUND: Electronic health records (EHRs) are a valuable resource for data-driven medical research. However, the presence of protected health information (PHI) makes EHRs unsuitable to be shared for research purposes. De-identification, i.e. the process of removing PHI is a critical step in making EHR data accessible. Natural language processing has repeatedly demonstrated its feasibility in automating the de-identification process. OBJECTIVES: Our study aims to provide systematic evidence on how the de-identification of clinical free text written in English has evolved in the last thirteen years, and to report on the performances and limitations of the current state-of-the-art systems for the English language. In addition, we aim to identify challenges and potential research opportunities in this field. METHODS: A systematic search in PubMed, Web of Science, and the DBLP was conducted for studies published between January 2010 and February 2023. Titles and abstracts were examined to identify the relevant studies. Selected studies were then analysed in-depth, and information was collected on de-identification methodologies, data sources, and measured performance. RESULTS: A total of 2125 publications were identified for the title and abstract screening. 69 studies were found to be relevant. Machine learning (37 studies) and hybrid (26 studies) approaches are predominant, while six studies relied only on rules. The majority of the approaches were trained and evaluated on public corpora. The 2014 i2b2/UTHealth corpus is the most frequently used (36 studies), followed by the 2006 i2b2 (18 studies) and 2016 CEGS N-GRID (10 studies) corpora. CONCLUSION: Earlier de-identification approaches aimed at English were mainly rule and machine learning hybrids with extensive feature engineering and post-processing, while more recent performance improvements are due to feature-inferring recurrent neural networks. Current leading performance is achieved using attention-based neural models. Recent studies report state-of-the-art F1-scores (over 98 %) when evaluated in the manner usually adopted by the clinical natural language processing community. However, their performance needs to be more thoroughly assessed with different measures to judge their reliability to safely de-identify data in a real-world setting. Without additional manually labeled training data, state-of-the-art systems fail to generalise well across a wide range of clinical sub-domains.


Asunto(s)
Registros Electrónicos de Salud , Procesamiento de Lenguaje Natural , Humanos , Aprendizaje Automático
6.
Int J Med Inform ; 164: 104805, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35653828

RESUMEN

BACKGROUND AND OBJECTIVES: The importance of clinical natural language processing (NLP) has increased with the adoption of electronic health records (EHRs). One of the critical tasks in clinical NLP is named entity recognition (NER). Clinical NER in the Serbian language is a severely under-researched area. The few approaches that have been proposed so far are based on rules or machine-learning models with hand-crafted features, while current state-of-the-art models have not been explored. The objective of this paper is to assess the performance of state-of-the-art NER methods on clinical narratives in the Serbian language. MATERIALS AND METHODS: We designed an experimental setup for a comprehensive evaluation of state-of-the-art NER models. The gold standard corpus we used for the evaluation is comprised of discharge summaries from the Clinic for Nephrology at the University Clinical Center of Serbia. The following models were evaluated: conditional random fields (CRF), multilingual transformers (BERT Multilingual and XLM RoBERTa), and long short-term memory (LSTM) recurrent neural networks, and their ensembles. In addition, we investigated the necessity of the pretraining task of transformer based models and the use of pretrained word embeddings with LSTM model. RESULTS: Our results show that individually CRF had the best precision, the pretrained BERT Multilingual model had the best recall values, and the LSTM model had the best F1 score. The best performance was achieved by combining the existing models in a majority voting ensemble with an F1 score of 0.892. The presented results are similar to the inter annotator agreement on our gold standard corpus and are comparable to existing state-of-the-art results for clinical NER reported in literature. CONCLUSION: Existing state-of-the-art models can provide viable results for clinical named entity recognition when applied to languages with the complexity of the Serbian language without major modifications.


Asunto(s)
Registros Electrónicos de Salud , Procesamiento de Lenguaje Natural , Humanos , Aprendizaje Automático , Redes Neurales de la Computación , Serbia
7.
J Biomed Semantics ; 6: 29, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26131352

RESUMEN

BACKGROUND: There are numerous options available to achieve various tasks in bioinformatics, but until recently, there were no tools that could systematically identify mentions of databases and tools within the literature. In this paper we explore the variability and ambiguity of database and software name mentions and compare dictionary and machine learning approaches to their identification. RESULTS: Through the development and analysis of a corpus of 60 full-text documents manually annotated at the mention level, we report high variability and ambiguity in database and software mentions. On a test set of 25 full-text documents, a baseline dictionary look-up achieved an F-score of 46 %, highlighting not only variability and ambiguity but also the extensive number of new resources introduced. A machine learning approach achieved an F-score of 63 % (with precision of 74 %) and 70 % (with precision of 83 %) for strict and lenient matching respectively. We characterise the issues with various mention types and propose potential ways of capturing additional database and software mentions in the literature. CONCLUSIONS: Our analyses show that identification of mentions of databases and tools is a challenging task that cannot be achieved by relying on current manually-curated resource repositories. Although machine learning shows improvement and promise (primarily in precision), more contextual information needs to be taken into account to achieve a good degree of accuracy.

8.
J Am Med Inform Assoc ; 20(5): 859-66, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23605114

RESUMEN

OBJECTIVE: Identification of clinical events (eg, problems, tests, treatments) and associated temporal expressions (eg, dates and times) are key tasks in extracting and managing data from electronic health records. As part of the i2b2 2012 Natural Language Processing for Clinical Data challenge, we developed and evaluated a system to automatically extract temporal expressions and events from clinical narratives. The extracted temporal expressions were additionally normalized by assigning type, value, and modifier. MATERIALS AND METHODS: The system combines rule-based and machine learning approaches that rely on morphological, lexical, syntactic, semantic, and domain-specific features. Rule-based components were designed to handle the recognition and normalization of temporal expressions, while conditional random fields models were trained for event and temporal recognition. RESULTS: The system achieved micro F scores of 90% for the extraction of temporal expressions and 87% for clinical event extraction. The normalization component for temporal expressions achieved accuracies of 84.73% (expression's type), 70.44% (value), and 82.75% (modifier). DISCUSSION: Compared to the initial agreement between human annotators (87-89%), the system provided comparable performance for both event and temporal expression mining. While (lenient) identification of such mentions is achievable, finding the exact boundaries proved challenging. CONCLUSIONS: The system provides a state-of-the-art method that can be used to support automated identification of mentions of clinical events and temporal expressions in narratives either to support the manual review process or as a part of a large-scale processing of electronic health databases.


Asunto(s)
Inteligencia Artificial , Registros Electrónicos de Salud , Almacenamiento y Recuperación de la Información/métodos , Humanos , Procesamiento de Lenguaje Natural , Tiempo , Investigación Biomédica Traslacional
9.
Biomed Inform Insights ; 5(Suppl. 1): 115-24, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22879767

RESUMEN

We describe and evaluate an automated approach used as part of the i2b2 2011 challenge to identify and categorise statements in suicide notes into one of 15 topics, including Love, Guilt, Thankfulness, Hopelessness and Instructions. The approach combines a set of lexico-syntactic rules with a set of models derived by machine learning from a training dataset. The machine learning models rely on named entities, lexical, lexico-semantic and presentation features, as well as the rules that are applicable to a given statement. On a testing set of 300 suicide notes, the approach showed the overall best micro F-measure of up to 53.36%. The best precision achieved was 67.17% when only rules are used, whereas best recall of 50.57% was with integrated rules and machine learning. While some topics (eg, Sorrow, Anger, Blame) prove challenging, the performance for relatively frequent (eg, Love) and well-scoped categories (eg, Thankfulness) was comparatively higher (precision between 68% and 79%), suggesting that automated text mining approaches can be effective in topic categorisation of suicide notes.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA