Your browser doesn't support javascript.
loading
Classification of estrogenic compounds by coupling high content analysis and machine learning algorithms.
Mukherjee, Rajib; Beykal, Burcu; Szafran, Adam T; Onel, Melis; Stossi, Fabio; Mancini, Maureen G; Lloyd, Dillon; Wright, Fred A; Zhou, Lan; Mancini, Michael A; Pistikopoulos, Efstratios N.
Affiliation
  • Mukherjee R; Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America.
  • Beykal B; Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America.
  • Szafran AT; Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.
  • Onel M; Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States of America.
  • Stossi F; Texas A&M Energy Institute, Texas A&M University, College Station, TX, United States of America.
  • Mancini MG; Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, United States of America.
  • Lloyd D; Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States of America.
  • Wright FA; GCC Center for Advanced Microscopy and Image Informatics, Houston, TX, United States of America.
  • Zhou L; Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX, United States of America.
  • Mancini MA; GCC Center for Advanced Microscopy and Image Informatics, Houston, TX, United States of America.
  • Pistikopoulos EN; Bioinformatics Research Center, Center for Human Health and the Environment, Department of Statistics, North Carolina State University, Raleigh, NC, United States of America.
PLoS Comput Biol ; 16(9): e1008191, 2020 09.
Article in En | MEDLINE | ID: mdl-32970665
ABSTRACT
Environmental toxicants affect human health in various ways. Of the thousands of chemicals present in the environment, those with adverse effects on the endocrine system are referred to as endocrine-disrupting chemicals (EDCs). Here, we focused on a subclass of EDCs that impacts the estrogen receptor (ER), a pivotal transcriptional regulator in health and disease. Estrogenic activity of compounds can be measured by many in vitro or cell-based high throughput assays that record various endpoints from large pools of cells, and increasingly at the single-cell level. To simultaneously capture multiple mechanistic ER endpoints in individual cells that are affected by EDCs, we previously developed a sensitive high throughput/high content imaging assay that is based upon a stable cell line harboring a visible multicopy ER responsive transcription unit and expressing a green fluorescent protein (GFP) fusion of ER. High content analysis generates voluminous multiplex data comprised of minable features that describe numerous mechanistic endpoints. In this study, we present a machine learning pipeline for rapid, accurate, and sensitive assessment of the endocrine-disrupting potential of benchmark chemicals based on data generated from high content analysis. The multidimensional imaging data was used to train a classification model to ultimately predict the impact of unknown compounds on the ER, either as agonists or antagonists. To this end, both linear logistic regression and nonlinear Random Forest classifiers were benchmarked and evaluated for predicting the estrogenic activity of unknown compounds. Furthermore, through feature selection, data visualization, and model discrimination, the most informative features were identified for the classification of ER agonists/antagonists. The results of this data-driven study showed that highly accurate and generalized classification models with a minimum number of features can be constructed without loss of generality, where these machine learning models serve as a means for rapid mechanistic/phenotypic evaluation of the estrogenic potential of many chemicals.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Estrogens / Machine Learning Type of study: Prognostic_studies Limits: Humans Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2020 Document type: Article Affiliation country: Estados Unidos Publication country: EEUU / ESTADOS UNIDOS / ESTADOS UNIDOS DA AMERICA / EUA / UNITED STATES / UNITED STATES OF AMERICA / US / USA

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Estrogens / Machine Learning Type of study: Prognostic_studies Limits: Humans Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2020 Document type: Article Affiliation country: Estados Unidos Publication country: EEUU / ESTADOS UNIDOS / ESTADOS UNIDOS DA AMERICA / EUA / UNITED STATES / UNITED STATES OF AMERICA / US / USA