Your browser doesn't support javascript.
loading
Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches.
Miao, Shumei; Xu, Tingyu; Wu, Yonghui; Xie, Hui; Wang, Jingqi; Jing, Shenqi; Zhang, Yaoyun; Zhang, Xiaoliang; Yang, Yinshuang; Zhang, Xin; Shan, Tao; Wang, Li; Xu, Hua; Wang, Shui; Liu, Yun.
Afiliação
  • Miao S; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Xu T; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Wu Y; School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Xie H; Department of Breast Diseases, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China.
  • Wang J; School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Jing S; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Zhang Y; School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Zhang X; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Yang Y; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Zhang X; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Shan T; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China.
  • Wang L; Department of Medical Informatics, Medical School, Nantong University, Nantong, Jiangsu, China.
  • Xu H; School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Wang S; Department of Breast Diseases, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China. Electronic address: ws0801@hotmail.com.
  • Liu Y; Department of Information, The First Affiliated Hospital of Nanjing Medical University & Jiangsu Province Hospital, Nanjing, Jiangsu, China; Institute of Medical Informatics and Management, Nanjing Medical University, Nanjing, Jiangsu, China. Electronic address: liuyun@njmu.edu.cn.
Int J Med Inform ; 119: 17-21, 2018 11.
Article em En | MEDLINE | ID: mdl-30342682
BACKGROUND: The wide adoption of electronic health record systems (EHRs) in hospitals in China has made large amounts of data available for clinical research including breast cancer. Unfortunately, much of detailed clinical information is embedded in clinical narratives e.g., breast radiology reports. The American College of Radiology (ACR) has developed a Breast Imaging Reporting and Data System (BI-RADS) to standardize the clinical findings from breast radiology reports. OBJECTIVES: This study aims to develop natural language processing (NLP) methods to extract BI-RADS findings from breast ultrasound reports in Chinese, thus to support clinical operation and breast cancer research in China. METHODS: We developed and compared three different types of NLP approaches, including a rule-based method, a traditional machine learning-based method using the Conditional Random Fields (CRF) algorithm, and deep learning-based approaches, to extract all BI-RADS finding categories from breast ultrasound reports in Chinese. RESULTS: Using a manually annotated dataset containing 540 reports, our evaluation shows that the deep learning-based method achieved the best F1-score of 0.904, when compared with rule-based and CRF-based approaches (0.848 and 0.881 respectively). CONCLUSIONS: This is the first study that applies deep learning technologies to BI-RADS findings extraction in Chinese breast ultrasound reports, demonstrating its potential on enabling international collaborations on breast cancer research.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Limite: Female / Humans País/Região como assunto: Asia Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Limite: Female / Humans País/Região como assunto: Asia Idioma: En Ano de publicação: 2018 Tipo de documento: Article