Your browser doesn't support javascript.
loading
Analysis of Half a Billion Datapoints Across Ten Machine-Learning Algorithms Identifies Key Elements Associated With Insulin Transcription in Human Pancreatic Islet Cells.
Wong, Wilson K M; Thorat, Vinod; Joglekar, Mugdha V; Dong, Charlotte X; Lee, Hugo; Chew, Yi Vee; Bhave, Adwait; Hawthorne, Wayne J; Engin, Feyza; Pant, Aniruddha; Dalgaard, Louise T; Bapat, Sharda; Hardikar, Anandwardhan A.
Afiliação
  • Wong WKM; Diabetes and Islet Biology Group, School of Medicine, Western Sydney University, Campbelltown, NSW, Australia.
  • Thorat V; Healthcare Analytics, AlgoAnalytics, Pune, India.
  • Joglekar MV; Diabetes and Islet Biology Group, School of Medicine, Western Sydney University, Campbelltown, NSW, Australia.
  • Dong CX; Diabetes and Islet Biology Group, School of Medicine, Western Sydney University, Campbelltown, NSW, Australia.
  • Lee H; Department of Biomolecular Chemistry, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States.
  • Chew YV; Centre for Transplant and Renal Research, Westmead Institute for Medical Research, University of Sydney, Westmead, NSW, Australia.
  • Bhave A; Healthcare Analytics, AlgoAnalytics, Pune, India.
  • Hawthorne WJ; Centre for Transplant and Renal Research, Westmead Institute for Medical Research, University of Sydney, Westmead, NSW, Australia.
  • Engin F; Department of Biomolecular Chemistry, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States.
  • Pant A; Division of Endocrinology, Diabetes & Metabolism, Department of Medicine, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, United States.
  • Dalgaard LT; Healthcare Analytics, AlgoAnalytics, Pune, India.
  • Bapat S; Department of Science and Environment, Roskilde University, Roskilde, Denmark.
  • Hardikar AA; Healthcare Analytics, AlgoAnalytics, Pune, India.
Front Endocrinol (Lausanne) ; 13: 853863, 2022.
Article em En | MEDLINE | ID: mdl-35399953
ABSTRACT
Machine learning (ML)-workflows enable unprejudiced/robust evaluation of complex datasets. Here, we analyzed over 490,000,000 data points to compare 10 different ML-workflows in a large (N=11,652) training dataset of human pancreatic single-cell (sc-)transcriptomes to identify genes associated with the presence or absence of insulin transcript(s). Prediction accuracy/sensitivity of each ML-workflow was tested in a separate validation dataset (N=2,913). Ensemble ML-workflows, in particular Random Forest ML-algorithm delivered high predictive power (AUC=0.83) and sensitivity (0.98), compared to other algorithms. The transcripts identified through these analyses also demonstrated significant correlation with insulin in bulk RNA-seq data from human islets. The top-10 features, (including IAPP, ADCYAP1, LDHA and SST) common to the three Ensemble ML-workflows were significantly dysregulated in scRNA-seq datasets from Ire-1αß-/- mice that demonstrate dedifferentiation of pancreatic ß-cells in a model of type 1 diabetes (T1D) and in pancreatic single cells from individuals with type 2 Diabetes (T2D). Our findings provide direct comparison of ML-workflows in big data analyses, identify key elements associated with insulin transcription and provide workflows for future analyses.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Ilhotas Pancreáticas / Diabetes Mellitus Tipo 2 Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans Idioma: En Revista: Front Endocrinol (Lausanne) Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Austrália

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Ilhotas Pancreáticas / Diabetes Mellitus Tipo 2 Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Animals / Humans Idioma: En Revista: Front Endocrinol (Lausanne) Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Austrália