Your browser doesn't support javascript.
loading
Dense phenotyping from electronic health records enables machine learning-based prediction of preterm birth.
Abraham, Abin; Le, Brian; Kosti, Idit; Straub, Peter; Velez-Edwards, Digna R; Davis, Lea K; Newton, J M; Muglia, Louis J; Rokas, Antonis; Bejan, Cosmin A; Sirota, Marina; Capra, John A.
Afiliação
  • Abraham A; Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA.
  • Le B; Vanderbilt University Medical Center, Vanderbilt University, Nashville, TN, 37232, USA.
  • Kosti I; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.
  • Straub P; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.
  • Velez-Edwards DR; Department of Pediatrics, University of California, San Francisco, San Francisco, CA, USA.
  • Davis LK; Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA.
  • Newton JM; Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
  • Muglia LJ; Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA.
  • Rokas A; Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.
  • Bejan CA; Department of Obstetrics and Gynecology, Vanderbilt University Medical Center, Nashville, TN, USA.
  • Sirota M; Vanderbilt Genetics Institute, Vanderbilt University, Nashville, TN, 37235, USA.
  • Capra JA; Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA.
BMC Med ; 20(1): 333, 2022 09 28.
Article em En | MEDLINE | ID: mdl-36167547
ABSTRACT

BACKGROUND:

Identifying pregnancies at risk for preterm birth, one of the leading causes of worldwide infant mortality, has the potential to improve prenatal care. However, we lack broadly applicable methods to accurately predict preterm birth risk. The dense longitudinal information present in electronic health records (EHRs) is enabling scalable and cost-efficient risk modeling of many diseases, but EHR resources have been largely untapped in the study of pregnancy.

METHODS:

Here, we apply machine learning to diverse data from EHRs with 35,282 deliveries to predict singleton preterm birth.

RESULTS:

We find that machine learning models based on billing codes alone can predict preterm birth risk at various gestational ages (e.g., ROC-AUC = 0.75, PR-AUC = 0.40 at 28 weeks of gestation) and outperform comparable models trained using known risk factors (e.g., ROC-AUC = 0.65, PR-AUC = 0.25 at 28 weeks). Examining the patterns learned by the model reveals it stratifies deliveries into interpretable groups, including high-risk preterm birth subtypes enriched for distinct comorbidities. Our machine learning approach also predicts preterm birth subtypes (spontaneous vs. indicated), mode of delivery, and recurrent preterm birth. Finally, we demonstrate the portability of our approach by showing that the prediction models maintain their accuracy on a large, independent cohort (5978 deliveries) from a different healthcare system.

CONCLUSIONS:

By leveraging rich phenotypic and genetic features derived from EHRs, we suggest that machine learning algorithms have great potential to improve medical care during pregnancy. However, further work is needed before these models can be applied in clinical settings.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Nascimento Prematuro Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Female / Humans / Newborn / Pregnancy Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Nascimento Prematuro Tipo de estudo: Diagnostic_studies / Prognostic_studies / Risk_factors_studies Limite: Female / Humans / Newborn / Pregnancy Idioma: En Ano de publicação: 2022 Tipo de documento: Article