Your browser doesn't support javascript.
loading
A Machine Learning Classifier for Assigning Individual Patients With Systemic Sclerosis to Intrinsic Molecular Subsets.
Franks, Jennifer M; Martyanov, Viktor; Cai, Guoshuai; Wang, Yue; Li, Zhenghui; Wood, Tammara A; Whitfield, Michael L.
Afiliação
  • Franks JM; Geisel School of Medicine at Dartmouth, Department of Molecular and Systems Biology, Hanover and Lebanon, New Hampshire.
  • Martyanov V; Geisel School of Medicine at Dartmouth, Hanover, New Hampshire.
  • Cai G; Arnold School of Public Health at University of South Carolina, Columbia.
  • Wang Y; Geisel School of Medicine at Dartmouth, Hanover, New Hampshire.
  • Li Z; Geisel School of Medicine at Dartmouth, Hanover, New Hampshire.
  • Wood TA; Geisel School of Medicine at Dartmouth, Hanover, New Hampshire.
  • Whitfield ML; Geisel School of Medicine at Dartmouth, Department of Molecular and Systems Biology, Hanover and Lebanon, New Hampshire.
Arthritis Rheumatol ; 71(10): 1701-1710, 2019 10.
Article em En | MEDLINE | ID: mdl-30920766
ABSTRACT

OBJECTIVE:

High-throughput gene expression profiling of tissue samples from patients with systemic sclerosis (SSc) has identified 4 "intrinsic" gene expression subsets inflammatory, fibroproliferative, normal-like, and limited. Prior methods required agglomerative clustering of many samples. In order to classify individual patients in clinical trials or for diagnostic purposes, supervised methods that can assign single samples to molecular subsets are required. We undertook this study to introduce a novel machine learning classifier as a robust accurate intrinsic subset predictor.

METHODS:

Three independent gene expression cohorts were curated and merged to create a data set covering 297 skin biopsy samples from 102 unique patients and controls, which was used to train a machine learning algorithm. We performed external validation using 3 independent SSc cohorts, including a gene expression data set generated by an independent laboratory on a different microarray platform. In total, 413 skin biopsy samples from 213 individuals were analyzed in the training and testing cohorts.

RESULTS:

Repeated cross-fold validation identified consistent and discriminative markers using multinomial elastic net, performing with an average classification accuracy of 87.1% with high sensitivity and specificity. In external validation, the classifier achieved an average accuracy of 85.4%. Reanalyzing data from a previous study, we identified subsets of patients that represent the canonical inflammatory, fibroproliferative, and normal-like subsets.

CONCLUSION:

We developed a highly accurate classifier for SSc molecular subsets for individual patient samples. The method can be used in SSc clinical trials to identify an intrinsic subset on individual samples. Our method provides a robust data-driven approach to aid clinical decision-making and interpretation of heterogeneous molecular information in SSc patients.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Escleroderma Sistêmico / Transcriptoma / Aprendizado de Máquina Supervisionado Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Escleroderma Sistêmico / Transcriptoma / Aprendizado de Máquina Supervisionado Idioma: En Ano de publicação: 2019 Tipo de documento: Article