Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
AMIA Annu Symp Proc ; 2021: 910-919, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-35308904

RESUMO

Finding concepts in large clinical ontologies can be challenging when queries use different vocabularies. A search algorithm that overcomes this problem is useful in applications such as concept normalisation and ontology matching, where concepts can be referred to in different ways, using different synonyms. In this paper, we present a deep learning based approach to build a semantic search system for large clinical ontologies. We propose a Triplet-BERT model and a method that generates training data directly from the ontologies. The model is evaluated using five real benchmark data sets and the results show that our approach achieves high results on both free text to concept and concept to concept searching tasks, and outperforms all baseline methods.


Assuntos
Ontologias Biológicas , Semântica , Algoritmos , Humanos , Vocabulário , Vocabulário Controlado
2.
Stud Health Technol Inform ; 266: 136-141, 2019 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-31397314

RESUMO

Clinical terminologies play an essential role in enabling semantic interoperability between medical records. However, existing terminologies have several issues that impact data quality, such as content gaps and slow updates. In this study we explore the suitability of existing, community-driven resources, specifically Wikipedia, as a potential source to bootstrap an open clinical terminology, in terms of content coverage. In order to establish the extent of the coverage, a team of expert clinical terminologists manually mapped a clinically-relevant subset of SNOMED CT to Wikipedia articles. The results show that approximately 80% of the concepts are covered by Wikipedia. Most concepts that do not have a direct match in Wikipedia are composable from multiple articles. These findings are encouraging and suggest that it should be possible to bootstrap an open clinical terminology from Wikipedia.


Assuntos
Prontuários Médicos , Systematized Nomenclature of Medicine
3.
AMIA Annu Symp Proc ; 2018: 807-816, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30815123

RESUMO

Computer-assisted (diagnostic) coding (CAC) aims to improve the operational productivity and accuracy of clinical coders. The level of accuracy, especially for a wide range of complex and less prevalent clinical cases, remains an open research problem. This study investigates this problem on a broad spectrum of diagnostic codes and, in particular, investigates the effectiveness of utilising SNOMED CT for ICD-10 diagnosis coding. Hospital progress notes were used to provide the narrative rich electronic patient records for the investigation. A natural language processing (NLP) approach using mappings between SNOMED CT and ICD-10-AM (Australian Modification) was used to guide the coding. The proposed approach achieved 54.1% sensitivity and 70.2% positive predictive value. Given the complexity of the task, this was encouraging given the simplicity of the approach and what was projected as possible from a manual diagnosis code validation study (76.3% sensitivity). The results show the potential for advanced NLP-based approaches that leverage SNOMED CT to ICD-10 mapping for hospital in-patient coding.


Assuntos
Codificação Clínica/métodos , Classificação Internacional de Doenças , Processamento de Linguagem Natural , Systematized Nomenclature of Medicine , Austrália , Registros Eletrônicos de Saúde , Hospitais , Humanos , Unified Medical Language System
4.
BMC Med Inform Decis Mak ; 15: 53, 2015 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-26174442

RESUMO

BACKGROUND: Death certificates provide an invaluable source for mortality statistics which can be used for surveillance and early warnings of increases in disease activity and to support the development and monitoring of prevention or response strategies. However, their value can be realised only if accurate, quantitative data can be extracted from death certificates, an aim hampered by both the volume and variable nature of certificates written in natural language. This study aims to develop a set of machine learning and rule-based methods to automatically classify death certificates according to four high impact diseases of interest: diabetes, influenza, pneumonia and HIV. METHODS: Two classification methods are presented: i) a machine learning approach, where detailed features (terms, term n-grams and SNOMED CT concepts) are extracted from death certificates and used to train a set of supervised machine learning models (Support Vector Machines); and ii) a set of keyword-matching rules. These methods were used to identify the presence of diabetes, influenza, pneumonia and HIV in a death certificate. An empirical evaluation was conducted using 340,142 death certificates, divided between training and test sets, covering deaths from 2000-2007 in New South Wales, Australia. Precision and recall (positive predictive value and sensitivity) were used as evaluation measures, with F-measure providing a single, overall measure of effectiveness. A detailed error analysis was performed on classification errors. RESULTS: Classification of diabetes, influenza, pneumonia and HIV was highly accurate (F-measure 0.96). More fine-grained ICD-10 classification effectiveness was more variable but still high (F-measure 0.80). The error analysis revealed that word variations as well as certain word combinations adversely affected classification. In addition, anomalies in the ground truth likely led to an underestimation of the effectiveness. CONCLUSIONS: The high accuracy and low cost of the classification methods allow for an effective means for automatic and real-time surveillance of diabetes, influenza, pneumonia and HIV deaths. In addition, the methods are generally applicable to other diseases of interest and to other sources of medical free-text besides death certificates.


Assuntos
Classificação , Atestado de Óbito , Monitoramento Epidemiológico , Aprendizado de Máquina , Humanos , New South Wales
5.
J Biomed Inform ; 55: 73-81, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25817970

RESUMO

CSIRO Adverse Drug Event Corpus (Cadec) is a new rich annotated corpus of medical forum posts on patient-reported Adverse Drug Events (ADEs). The corpus is sourced from posts on social media, and contains text that is largely written in colloquial language and often deviates from formal English grammar and punctuation rules. Annotations contain mentions of concepts such as drugs, adverse effects, symptoms, and diseases linked to their corresponding concepts in controlled vocabularies, i.e., SNOMED Clinical Terms and MedDRA. The quality of the annotations is ensured by annotation guidelines, multi-stage annotations, measuring inter-annotator agreement, and final review of the annotations by a clinical terminologist. This corpus is useful for studies in the area of information extraction, or more generally text mining, from social media to detect possible adverse drug reactions from direct patient reports. The corpus is publicly available at https://data.csiro.au.(1).


Assuntos
Sistemas de Notificação de Reações Adversas a Medicamentos/organização & administração , Informação de Saúde ao Consumidor/organização & administração , Mineração de Dados/métodos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/classificação , Mídias Sociais/organização & administração , Vocabulário Controlado , Conjuntos de Dados como Assunto/estatística & dados numéricos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos/epidemiologia , Guias como Assunto , Humanos , Aprendizado de Máquina , Processamento de Linguagem Natural , Mídias Sociais/classificação , Terminologia como Assunto
6.
Stud Health Technol Inform ; 178: 111-6, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22797028

RESUMO

Clinical trial data have historically been implemented using relational databases. While this has expedited the dissemination of data among partners, it has hindered on the ability to swiftly query the data by relying on monolithic tables. This paper outlines a project that investigates the semantic enrichment of a large-scale longitudinal clinical trial, the AIBL study, by reusing entities from existing ontologies. The implication of the semantic enrichment of the AIBL study is that it is possible to query the data more effectively and efficiently. We are now able to implement our model and focus on an end-to-end data capture and analysis pipeline to query and visualise clinical trial data. The main contribution of this paper is a discussion of the methodology to semantically enrich clinical trial data using entities from existing ontologies.


Assuntos
Ensaios Clínicos como Assunto , Semântica , Humanos , Estudos Longitudinais , Systematized Nomenclature of Medicine
7.
Med J Aust ; 194(4): S8-10, 2011 Feb 21.
Artigo em Inglês | MEDLINE | ID: mdl-21401491

RESUMO

Emergency departments around Australia use a range of software to capture data on patients' reason for encounter, presenting problem and diagnosis. The data collected are mainly based on descriptions and codes of the International Classification of Diseases, 10th revision, Australian modification (ICD-10-AM), with each emergency department having a tailored list of terms. The National E-Health Transition Authority is introducing a standard clinical terminology, the Systematized Nomenclature of Medicine--Clinical Terms (SNOMED CT), as one of the building blocks of an e-health infrastructure in Australia. The Australian e-Health Research Centre has developed a software platform, Snapper, which facilitates mapping of existing clinical terms to the SNOMED CT terminology. Using the Snapper software, reference sets of terms for emergency departments are being developed, based on the Australian version of SNOMED CT (SNOMED CT-AU). Existing software systems need to be able to implement these reference sets to support standardised recording of data at the point of care. As the terms collected will be part of a larger terminology, they will be useful for patients' admission and discharge summaries and for computerised clinical decision making. Mapping existing sets of clinical terms to a national emergency department SNOMED CT reference set will facilitate consistency between emergency department data collections and improve the usefulness of the data for clinical and analytical purposes.


Assuntos
Bases de Dados Factuais , Serviço Hospitalar de Emergência/estatística & dados numéricos , Systematized Nomenclature of Medicine , Austrália , Humanos , Classificação Internacional de Doenças , Melhoria de Qualidade , Valores de Referência
8.
Health Inf Manag ; 34(2): 54-6, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-18239215

RESUMO

This short paper highlights some new work being performed at the National Centre for Classification in Health (NCCH), relating newly created term sets developed for specific purposes to existing reference terminologies and classifications such as SNOMED CT and ICD-10-AM. It describes some of the inherent difficulties experienced by the NCCH team in interpreting terms in the term set and therefore in locating equivalent concepts in reference terminologies and classifications, in the absence of a context with which to associate each term. Also examined is the effect that a person's background and past experience has on their understanding and interpretation of clinical terms and how this results in inconsistent "world views".


Assuntos
Controle de Formulários e Registros/classificação , Classificação Internacional de Doenças , Prontuários Médicos/classificação , Systematized Nomenclature of Medicine , Humanos , Semântica , Terminologia como Assunto
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA