Your browser doesn't support javascript.
loading
Temporal condition pattern mining in large, sparse electronic health record data: A case study in characterizing pediatric asthma.
Campbell, Elizabeth A; Bass, Ellen J; Masino, Aaron J.
Affiliation
  • Campbell EA; Department of Information Science, College of Computing & Informatics, Drexel University, Philadelphia, Pennsylvania, USA.
  • Bass EJ; Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, USA.
  • Masino AJ; Department of Information Science, College of Computing & Informatics, Drexel University, Philadelphia, Pennsylvania, USA.
J Am Med Inform Assoc ; 27(4): 558-566, 2020 04 01.
Article in En | MEDLINE | ID: mdl-32049282
ABSTRACT

OBJECTIVE:

This study introduces a temporal condition pattern mining methodology to address the sparse nature of coded condition concept utilization in electronic health record data. As a validation study, we applied this method to reveal condition patterns surrounding an initial diagnosis of pediatric asthma. MATERIALS AND

METHODS:

The SPADE (Sequential PAttern Discovery using Equivalence classes) algorithm was used to identify common temporal condition patterns surrounding the initial diagnosis of pediatric asthma in a study population of 71 824 patients from the Children's Hospital of Philadelphia. SPADE was applied to a dataset with diagnoses coded using International Classification of Diseases (ICD) concepts and separately to a dataset with the ICD codes mapped to their corresponding expanded diagnostic clusters (EDCs). Common temporal condition patterns surrounding the initial diagnosis of pediatric asthma ascertained by SPADE from both the ICD and EDC datasets were compared.

RESULTS:

SPADE identified 36 unique diagnoses in the mapped EDC dataset, whereas only 19 were recognized in the ICD dataset. Temporal trends in condition diagnoses ascertained from the EDC data were not discoverable in the ICD dataset.

DISCUSSION:

Mining frequent temporal condition patterns from large electronic health record datasets may reveal previously unknown associations between diagnoses that could inform future research into causation or other relationships. Mapping sparsely coded medical concepts into homogenous groups was essential to discovering potentially useful information from our dataset.

CONCLUSIONS:

We expect that the presented methodology is applicable to the study of diagnostic trajectories for other clinical conditions and can be extended to study temporal patterns of other coded medical concepts such as medications and procedures.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Asthma / Algorithms / Pattern Recognition, Automated / Electronic Health Records / Data Mining Type of study: Prognostic_studies Limits: Child / Humans Language: En Journal: J Am Med Inform Assoc Journal subject: INFORMATICA MEDICA Year: 2020 Document type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Asthma / Algorithms / Pattern Recognition, Automated / Electronic Health Records / Data Mining Type of study: Prognostic_studies Limits: Child / Humans Language: En Journal: J Am Med Inform Assoc Journal subject: INFORMATICA MEDICA Year: 2020 Document type: Article Affiliation country: United States