Your browser doesn't support javascript.
loading
A distributable German clinical corpus containing cardiovascular clinical routine doctor's letters.
Richter-Pechanski, Phillip; Wiesenbach, Philipp; Schwab, Dominic M; Kiriakou, Christina; He, Mingyang; Allers, Michael M; Tiefenbacher, Anna S; Kunz, Nicola; Martynova, Anna; Spiller, Noemie; Mierisch, Julian; Borchert, Florian; Schwind, Charlotte; Frey, Norbert; Dieterich, Christoph; Geis, Nicolas A.
Afiliación
  • Richter-Pechanski P; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany. phillip.richter-pechanski@med.uni-heidelberg.de.
  • Wiesenbach P; Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, DE, Germany. phillip.richter-pechanski@med.uni-heidelberg.de.
  • Schwab DM; German Center for Cardiovascular Research (DZHK) - Partner site Heidelberg/Mannheim, Heidelberg, DE, Germany. phillip.richter-pechanski@med.uni-heidelberg.de.
  • Kiriakou C; Informatics for Life, Heidelberg, DE, Germany. phillip.richter-pechanski@med.uni-heidelberg.de.
  • He M; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
  • Allers MM; Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, DE, Germany.
  • Tiefenbacher AS; Informatics for Life, Heidelberg, DE, Germany.
  • Kunz N; Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, DE, Germany.
  • Martynova A; Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, DE, Germany.
  • Spiller N; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
  • Mierisch J; Department of Internal Medicine III, University Hospital Heidelberg, Heidelberg, DE, Germany.
  • Borchert F; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
  • Schwind C; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
  • Frey N; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
  • Dieterich C; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
  • Geis NA; Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg, DE, Germany.
Sci Data ; 10(1): 207, 2023 04 14.
Article en En | MEDLINE | ID: mdl-37059736
ABSTRACT
We present CARDIODE, the first freely available and distributable large German clinical corpus from the cardiovascular domain. CARDIODE encompasses 500 clinical routine German doctor's letters from Heidelberg University Hospital, which were manually annotated. Our prospective study design complies well with current data protection regulations and allows us to keep the original structure of clinical documents consistent. In order to ease access to our corpus, we manually de-identified all letters. To enable various information extraction tasks the temporal information in the documents was preserved. We added two high-quality manual annotation layers to CARDIODE, (1) medication information and (2) CDA-compliant section classes. To the best of our knowledge, CARDIODE is the first freely available and distributable German clinical corpus in the cardiovascular domain. In summary, our corpus offers unique opportunities for collaborative and reproducible research on natural language processing models for German clinical texts.

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Tipo de estudio: Guideline / Observational_studies Idioma: En Revista: Sci Data Año: 2023 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Tipo de estudio: Guideline / Observational_studies Idioma: En Revista: Sci Data Año: 2023 Tipo del documento: Article País de afiliación: Alemania