Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
1.
J Biomed Inform ; 69: 203-217, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28404537

RESUMO

OBJECTIVE: To build a comprehensive corpus covering syntactic and semantic annotations of Chinese clinical texts with corresponding annotation guidelines and methods as well as to develop tools trained on the annotated corpus, which supplies baselines for research on Chinese texts in the clinical domain. MATERIALS AND METHODS: An iterative annotation method was proposed to train annotators and to develop annotation guidelines. Then, by using annotation quality assurance measures, a comprehensive corpus was built, containing annotations of part-of-speech (POS) tags, syntactic tags, entities, assertions, and relations. Inter-annotator agreement (IAA) was calculated to evaluate the annotation quality and a Chinese clinical text processing and information extraction system (CCTPIES) was developed based on our annotated corpus. RESULTS: The syntactic corpus consists of 138 Chinese clinical documents with 47,426 tokens and 2612 full parsing trees, while the semantic corpus includes 992 documents that annotated 39,511 entities with their assertions and 7693 relations. IAA evaluation shows that this comprehensive corpus is of good quality, and the system modules are effective. DISCUSSION: The annotated corpus makes a considerable contribution to natural language processing (NLP) research into Chinese texts in the clinical domain. However, this corpus has a number of limitations. Some additional types of clinical text should be introduced to improve corpus coverage and active learning methods should be utilized to promote annotation efficiency. CONCLUSIONS: In this study, several annotation guidelines and an annotation method for Chinese clinical texts were proposed, and a comprehensive corpus with its NLP modules were constructed, providing a foundation for further study of applying NLP techniques to Chinese texts in the clinical domain.


Assuntos
Curadoria de Dados , Processamento de Linguagem Natural , Semântica , China , Mineração de Dados , Humanos , Idioma , Narração
2.
J Biomed Inform ; 58 Suppl: S39-S46, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26315662

RESUMO

De-identification is a shared task of the 2014 i2b2/UTHealth challenge. The purpose of this task is to remove protected health information (PHI) from medical records. In this paper, we propose a novel de-identifier, WI-deId, based on conditional random fields (CRFs). A preprocessing module, which tokenizes the medical records using regular expressions and an off-the-shelf tokenizer, is introduced, and three groups of features are extracted to train the de-identifier model. The experiment shows that our system is effective in the de-identification of medical records, achieving a micro-F1 of 0.9232 at the i2b2 strict entity evaluation level.


Assuntos
Segurança Computacional , Confidencialidade , Mineração de Dados/métodos , Registros Eletrônicos de Saúde/organização & administração , Processamento de Linguagem Natural , Reconhecimento Automatizado de Padrão/métodos , China , Estudos de Coortes , Interpretação Estatística de Dados , Narração , Vocabulário Controlado
3.
Materials (Basel) ; 14(2)2021 Jan 13.
Artigo em Inglês | MEDLINE | ID: mdl-33451146

RESUMO

Cu-Ni-Si alloys are widely used in electrical and electronic industry owing to excellent electrical conductivity and strength. A suitable addition of Co in the Cu-Ni-Si alloys can improve its strength and deteriorate its electrical conductivity. In this work, Cu-Ni-Co-Si-P-Mg alloys with different Co content are employed to investigate the effects of Co on the properties and microstructure. The results showed that Co addition lead to the formation of (Ni, Co)2Si precipitates. (Ni, Co)2Si precipitate is harder to coarsen than δ-Ni2Si during aging. The larger the Co content in the alloys is, the smaller the precipitates formed is. There exists a threshold content of Co to divide the studied alloys into two groups. One group of theses alloys with <1 wt.% Co or Co/Ni ratio <0.56 has the same aging behavior as the Cu-Ni-Si-P-Mg alloy. On the contrary, the time to reach the peak hardness of aging for another group can be obviously delayed and its electrical conductivity decreases slightly with the increase of Co content. It can be attributed to the lower diffusion rate of Co than that of Ni in the Cu matrix. Meanwhile, the Co addition can inhibit the formation of P-enriched Ni-P phase in Co-containing alloys during aging. The as-quenched Cu-1.6Ni-1.2Co-0.65Si-0.1P-0.05Mg alloy can reach 257 HV and 38.7%IACS after aging at 500 °C for 3 h, respectively.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA